Research Article

Handling Big Data Scalability in Biological Domain Using Parallel and Distributed Processing: A Case of Three Biological Semantic Similarity Measures

Table 22

Total time reduction obtained using Threaded SSDD in a distributed system with input data divided by their similarity versus input data divided equally.

Number of Gene PairsImprovement Percentage (IP)
2 Slaves3 Slaves4 Slaves

1037.74-73.17267.90
10028.81-53.069.45
100035.0892.8989.00
1000024.7958.41157.26
10000062.0737.7136.95
100000030.7058.37-10.80
Average36.5320.2091.63