Research Article

A Novel Algorithm for Imbalance Data Classification Based on Neighborhood Hypergraph

Table 1

Data description.

Dataset Size Attribute Class label (minority : majority) Class distribution

Bupa 345 6C 01 : 02 145/200
Colic 368 7C 15N No : yes 136/232
Reprocessed 294 13C 01 : 00 106/188
Machine 209 7C Others : 2 74/135
Labor 57 8C 8N Bad : good 20/37
Tic 958 9N Negative : positive 332/626
Iris 150 4C Iris-virginica : others 50/100
Seed 210 7C 02 : others 70/140
Vc 310 6C Normal : Abnormal 100/210
Glass 214 9C 01, 02 : others 68/146
Haberman 306 3C 02 : 01 81/225
Transfusion 748 4C 01 : 00 178/570
Abalone (7 : 15) 494 7C 1N 15 : 07 103/391
Balance-scale 625 4C B : others 49/576
Abalone (9 : 18) 731 7C 1N 18 : 9 42/689
Yeast (POX : CTY) 483 8C POX : CYT 20/463
Car 1728 6N Good : others 69/1659
Yeast (ME2 : others) 1484 8C ME2 : others 51/1433

C: continuous, N: nominal.