Research Article

Efficient and Scalable Graph Similarity Joins in MapReduce

Algorithm 2

Replacement of filter of Algorithm 1.
(1) Map  
(2)   create SBF for each graph and insert the generated -gram signatures into it;
(3)   output or ;
(4) Shuffle conduct Cartesian product between pairs and pairs ;
(5) Reduce  
(6)   for any , calculate the intersection of and and estimate
     the number of common signatures;
(7)    invoke the count filtering for each pair and output it if it passes the condition;