Research Article
A Log-Based Anomaly Detection Method with Efficient Neighbor Searching and Automatic K Neighbor Selection
Table 4
Comparison of the effort of distance calculation.
| Datasets | Size | Log lines | Effort of distance calculation with traditional kNN | Average samples in buckets | Maximum samples in buckets |
| Liberty | 29.5 G | 266991013 | 191,839,098 | BGL | 1.207 G | 4747963 | 949024 | Thunderbird | 27.367 G | 211212192 | 43,087,287 | Spirit | 30.28 G | 272298969 | 78360273 | HDFS | 1.58 G | 11175629 | 362793 | Zookeeper | 10.4 M | 74380 | 49124 |
|
|