Research Article

Handling Big Data Scalability in Biological Domain Using Parallel and Distributed Processing: A Case of Three Biological Semantic Similarity Measures

Table 21

Average time reduction obtained using Threaded SSDD in a distributed system with input data divided by their similarity versus input data divided equally.

Number of Gene PairsImprovement Percentage (IP)
2 Slaves3 Slaves4 Slaves

10-20-84.42-89.55
100-31.74-74.06-84.86
1000-29.89-74.43-83.98
10000-61.79-77.92-72.71
100000-36.34-17.06-29.92
100000035.2292.79-38.32
Average-2.41E+01-3.92E+01-6.66E+01