Research Article
Handling Big Data Scalability in Biological Domain Using Parallel and Distributed Processing: A Case of Three Biological Semantic Similarity Measures
Table 14
Total time obtained using a distributed system (2, 3, and 4 slaves) with Enhanced Resnik and input data divided by their similarity.
| Number of Gene Pairs | Original Resnik Total Time (ns) | Threaded Resnik Total Time (ns) (Input Data Divided by Their Similarity) | % Threaded Resnik Total Time (Input Data Divided by Their Similarity) vs. Original Resnik Total Time | 2 Slaves | 3 Slaves | 4 Slaves | 2 Slaves | 3 Slaves | 4 Slaves |
| 10 | 2560906085 | 335066219 | 211190589 | 3208745299 | -86.92 | -91.75 | 25.30 | 100 | 5350898201 | 854315104 | 794283629 | 711803925 | -84.03 | -85.16 | -86.70 | 1000 | 5224382582 | 8390205103 | 7467893056 | 8870113085 | 60.60 | 42.94 | 69.78 | 10000 | 2997898214 | 93562482698 | 80426435081 | 68009514911 | 3020.94 | 2582.76 | 2168.57 | 100000 | 9417548254 | 5.81006E+11 | 6.31117E+11 | 6.47021E+11 | 6069.40 | 6601.50 | 6770.38 | 1000000 | 46988654302 | 6.55475E+12 | 6.35691E+12 | 6.12695E+12 | 13849.64 | 13428.62 | 12939.22 | Average | 12090047940 | 1.20648E+12 | 1.17949E+12 | 1.14246E+12 | 3804.94 | 3746.49 | 3647.76 |
|
|