Research Article
Two-Stage Bagging Pruning for Reducing the Ensemble Size and Improving the Classification Performance
Table 1
List of 28 datasets from UCI Machine Learning Repository and their brief descriptions.
| Abbr. | Name of the dataset | #Ins. | #C | #V |
| Aba | Abalone | 4177 | 3 | 8 | Adult | Adult | 48842 | 2 | 14 | Aus | Australian Credit | 690 | 2 | 14 | Bcw | Breast cancer Wisconsin | 699 | 2 | 10 | Bld | Liver Disorders | 345 | 2 | 6 | Cmc | Contraceptive Method Choice | 1473 | 3 | 9 | Col | Horse Colic | 368 | 2 | 27 | Cre | Credit Approval | 690 | 2 | 15 | Der | Dermatology | 366 | 6 | 34 | Ger | German Credit | 1000 | 2 | 24 | Gla | Glass | 214 | 6 | 9 | Hea | Statlog(Heart) | 270 | 2 | 13 | Hep | Hepatitis | 155 | 2 | 19 | Ion | Ionosphere | 351 | 2 | 34 | Kr-vs-kp | Chess End-Game | 3196 | 2 | 36 | Mam | Mammographic Mass | 961 | 2 | 5 | Pid | Pima Indians Diabetes | 769 | 2 | 8 | Spe | SPECTF heart | 267 | 2 | 44 | Tel | MAGIC gamma telescope | 19020 | 2 | 10 | Veh | Vehicle Silhouettes | 846 | 4 | 18 | Vot | Congressional Voting Records | 435 | 2 | 16 | Vow | Vowel Recognition | 990 | 11 | 10 | Yea | Protein Localization Sites | 1484 | 10 | 8 | Spambase | SPAM E-MAIL | 4601 | 2 | 57 | Tictacto | Tic-Tac-Toe Endgame | 958 | 2 | 9 | Wdbc | Wisconsin Diagnostic Breast Cancer | 569 | 2 | 30 | Wpbc | Wisconsin Prognostic Breast Cancer | 198 | 2 | 31 | Spect | SPECT Heart | 267 | 2 | 22 |
|
|
Note. #Ins., #C, and #V mean the number of instances, the number of classes, and the number of variables for the dataset, respectively.
|