Research Article

Prediction of Breast Cancer from Imbalance Respect Using Cluster-Based Undersampling Method

Table 1

Experimental datasets.

DatasetsNo. of data samplesNo. of featuresImbalance ratio

Small-scale datasets
(1) Abalone731816.4
(2) Bcwo68391.8577
(3) Pima33682.027
(4) Redwine1837113.21
(5) Redwine2880113.42
(6) Redwine37341112.85
(7) Redwine46911112.04
(8) Wbcd569301.8
(9) Whitewine1043115.4
(10) Yeast170781.8975
(11) Yeast262682.840
(12) Yeast389281.08

Large-scale dataset
(1) Breast cancer10229411716319
(2) Protein homology prediction1457517411146