Research Article

Handling Big Data Scalability in Biological Domain Using Parallel and Distributed Processing: A Case of Three Biological Semantic Similarity Measures

Table 19

Average time reduction obtained using Threaded Resnik with a distributed system and input data divided by their similarity versus input data divided equally.

Number of Gene PairsImprovement Percentage (IP)
2 Slaves3 Slaves4 Slaves

10-99.964.05-95.71
100-3.36-52.76-94.75
100010.12200-94.72
10000957.291999.38-94.62
100000258.33183.48-98.46
1000000417.40232.39-73.99
Average256.6445588.22-92.04