Research Article

Effective and Fast Near Duplicate Detection via Signature-Based Compression Metrics

Table 2

SigNCD versus the baselines for Gold Set.

Algorithms Prec. Rec. Runtime (ms)

SigNCD w/ P1 0.95 0.89 0.92 7838
SpotSigNCD w/ P1 0.97 0.87 0.92 10853
NCD 0.90 0.78 0.83 56829
SpotSigs 0.90 0.77 0.83 12824
Google’s simhash 0.85 0.45 0.59 13010
SL+ST 0.74 0.53 0.62 242383