Research Article

HSDP: A Hybrid Sampling Method for Imbalanced Big Data Based on Data Partition

Table 1

Dataset information.

DatasetSamplesAttributesClassesImbalance ratio

Pima7688{Negative, positive}1.87
Yeast314848{Negative, positive}8.1
Abalone1941748{Negative, positive}129.44
Segment0230819{Negative, positive}6.02
Page-blocks0547210{Negative, positive}8.79
Glass52149{Negative, positive}22.78
Ecoli43367{Negative, positive}15.8
Haberman3063{Negative: “1” other, positive: “2”}2.78