Research Article

Accurate Counting Bloom Filters for Large-Scale Data Processing

Table 1

Reduce-side join performance comparisons in Hadoop.

Filter parameters / = 10, False positive probabilityMap inputs (MB)Map outputs (MB)Filter construction times (s)Total execution times (s)

Join + CBF0.01772252.545.162115
Join + ACB 0.00491252.529.86892