Research Article

A New Robust Classifier on Noise Domains: Bagging of Credal C4.5 Trees

Table 2

Data set description. Column “N” is the number of instances in the data sets, column “Feat” is the number of features or attribute variables, column “Num” is the number of numerical variables, column “Nom” is the number of nominal variables, column “k” is the number of cases or states of the class variable (always a nominal variable), and column “Range” is the range of states of the nominal variables of each data set.

Data set NFeatNumNomkRange

anneal8983863262–10
arrhythmia45227920673162
audiology22669069242–6
autos20525151072–22
balance-scale6254403
breast-cancer28690922–13
wisconsin-breast-cancer6999902
car172860643-4
cmc147392732–4
horse-colic3682271522–6
credit-rating690156922–14
german-credit10002071322–11
dermatology3663413362–4
pima-diabetes7688802
ecoli3667707
Glass2149907
haberman306321212
cleveland-14-heart-disease303136752–14
hungarian-14-heart-disease294136752–14
heart-statlog270131302
hepatitis1551941522
hypothyroid37723072342–4
ionosphere351353502
iris1504403
kr-vs-kp31963603622-3
letter200001616026
liver-disorders3456602
lymphography1461831542–8
mfeat-pixel20002400240104–6
nursery1296080842–4
optdigits56206464010
page-blocks5473101005
pendigits109921616010
primary-tumor33917017212-3
segment2310191607
sick37722972222
solar-flare21066120632–8
sonar208606002
soybean68335035192–7
spambase4601575702
spectrometer5311011001484
splice31906006034–6
Sponge764404432–9
tae15153232
vehicle946181804
vote4351601622
vowel99011101112
waveform5000404003
wine178131303
zoo1011611672