Research Article
Handling Big Data Scalability in Biological Domain Using Parallel and Distributed Processing: A Case of Three Biological Semantic Similarity Measures
Table 15
Average time obtained using a distributed system (2, 3, and 4 slaves) with Enhanced SSDD and input data divided by their similarity.
| Number of Gene Pairs | Original SSDD Average Time (ns) | Threaded SSDD Average Time (ns) (Input Data Divided by Their Similarity) | % Threaded SSDD Average Time (Input Data Divided by Their Similarity) vs. Original SSDD Average Time | 2 Slaves | 3 Slaves | 4 Slaves | 2 Slaves | 3 Slaves | 4 Slaves |
| 10 | 2.92E+08 | 1.24E+08 | 8.63E+10 | 1.17E+11 | -57.55 | 29444.68 | 39954.78 | 100 | 1.32E+08 | 5.72E+07 | 1.04E+10 | 1.06E+10 | -56.58 | 7794.49 | 7946.31 | 1000 | 9.32E+07 | 4.55E+07 | 1.01E+09 | 1.08E+09 | -51.16 | 984.15 | 1059.29 | 10000 | 4.62E+07 | 4.47E+07 | 9.67E+07 | 1.82E+08 | -3.23 | 109.35 | 294.01 | 100000 | 4.48E+07 | 2.47E+07 | 6.66E+07 | 4.38E+07 | -44.83 | 48.75 | -2.17 | 1000000 | 2.83E+07 | 6.45E+07 | 6.42E+07 | 4.12E+07 | 128.07 | 127.01 | 45.68 | Average | 1.06E+08 | 6.01E+07 | 1.63E+10 | 2.15E+10 | -1.42E+01 | 6.42E+03 | 8.22E+03 |
|
|