Research Article

Handling Big Data Scalability in Biological Domain Using Parallel and Distributed Processing: A Case of Three Biological Semantic Similarity Measures

Table 20

Total time reduction obtained using Threaded Resnik with a distributed system and input data divided by their similarity versus input data divided equally.

Number of Gene PairsImprovement Percentage (IP)
2 Slaves3 Slaves4 Slaves

10-11.79-28.591446.61
100263.75342.6776.45
10001436.492081.142021.31
100007012.9415755.4116546.66
10000017116.6019291.1317176.28
100000029539.7532964.8644558.83
Average9226.2911712.5013637.69