Research Article
Prediction of Breast Cancer from Imbalance Respect Using Cluster-Based Undersampling Method
| Datasets | No. of data samples | No. of features | Imbalance ratio |
| Small-scale datasets | (1) Abalone | 731 | 8 | 16.4 | (2) Bcwo | 683 | 9 | 1.8577 | (3) Pima | 336 | 8 | 2.027 | (4) Redwine1 | 837 | 11 | 3.21 | (5) Redwine2 | 880 | 11 | 3.42 | (6) Redwine3 | 734 | 11 | 12.85 | (7) Redwine4 | 691 | 11 | 12.04 | (8) Wbcd | 569 | 30 | 1.8 | (9) Whitewine | 1043 | 11 | 5.4 | (10) Yeast1 | 707 | 8 | 1.8975 | (11) Yeast2 | 626 | 8 | 2.840 | (12) Yeast3 | 892 | 8 | 1.08 |
| Large-scale dataset | (1) Breast cancer | 102294 | 117 | 16319 | (2) Protein homology prediction | 145751 | 74 | 11146 |
|
|