Research Article

HSDP: A Hybrid Sampling Method for Imbalanced Big Data Based on Data Partition

Table 4

G-means values of KNN + various sampling methods.

DatasetAlgorithm
KNNSMOTE + KNNADASYN + KNNBorderline-SMOTE + KNNHSDP (proposed) + KNN

Pima0.65030.66730.67210.66350.6932
Yeast30.75360.77860.77980.78240.8034
Abalone190.31650.60280.60370.43660.5437
Segment00.91540.93790.94400.92610.9528
Page-blocks00.81530.86290.86010.84210.8757
Glass50.78870.78570.78980.78900.7715
Ecoli40.70120.74010.79280.74290.8438
Haberman0.45020.48100.52110.49830.5201