Research Article

Efficient and Scalable Graph Similarity Joins in MapReduce

Table 3

Dataset statistics.

Dataset Disk size (GB)

Enamine 1,000,000 52.32 50.37 0.7