Research Article

Semi-Supervised Predictive Clustering Trees for (Hierarchical) Multi-Label Classification

Table 2

MLC datasets and their characteristics.

DatasetDomain

Bibtex [30]Text73951836/01592.402
Birds [31]Audio6452/258191.014
Emotions [32]Music5940/7261.869
Corel5k [33]Images5000499/03743.522
Enron [34]Text17021001/0533.378
Genbase [35]Text6621186/0271.252
Mediana [36]Media795321/5851.205
Medical [37]Text9781449/0451.245
Scene [38]Images24070/29461.074
SIGMEA real [39]Ecology8170/420.726
Slovenian rivers [40]Ecology10600/16145.073
Yeast [41]Biology24170/103144.237

is the number of examples, is the number of descriptive variables (nominal/continuous), is the number of labels, and is the average number of labels per example.