Research Article

Handling Data Skew in MapReduce Cluster by Using Partition Tuning

Table 4

Dataset characteristics.

Attribute characteristicTransaction
size (GB)
Numbers
of instances
Numbers
of attributes

Categorical, integer3.314,905,142247