Research Article

MapReduce Based Personalized Locality Sensitive Hashing for Similarity Joins on Large Scale Data

Table 1

An illustrative example of similarity joins based on Jaccard similarity. 0/1 indicates absence/presence of features in each instance.

Instance Feature

A 0 1 0 0 1 0
B 1 0 0 0 1 1
C 0 1 0 1 1 0
D 0 0 1 1 0 0
E 0 0 0 1 0 1