Research Article

A Log-Based Anomaly Detection Method with Efficient Neighbor Searching and Automatic K Neighbor Selection

Table 4

Comparison of the effort of distance calculation.

DatasetsSizeLog linesEffort of distance calculation with traditional kNN
Average samples in bucketsMaximum samples in buckets

Liberty29.5 G266991013191,839,098
BGL1.207 G4747963949024
Thunderbird27.367 G21121219243,087,287
Spirit30.28 G27229896978360273
HDFS1.58 G11175629362793
Zookeeper10.4 M7438049124