Research Article

Biomedical Text Categorization Based on Ensemble Pruning and Optimized Topic Modelling

Table 7

Comparison of the proposed text categorization scheme with conventional classifiers, ensemble learners, and ensemble pruning method (with BA-LDA (DB) based representation).

Classification algorithmoh5oh10oh15ohscalohsumed

NB87.6781.4287.4483.6447.09
SVM88.9782.2288.1685.3250.08
Bagging+NB89.3283.3588.8783.4748.52
Bagging+SVM88.0384.8487.8683.9250.73
AdaBoost+NB89.7783.6087.4886.1851.18
AdaBoost+SVM88.1884.9587.3586.2951.85
RandomSubspace+NB88.3283.9686.6688.0950.70
RandomSubspace+SVM88.5684.1189.5888.2950.29
Stacking88.2886.8788.9384.9053.84
ESM88.5886.6690.2588.4851.94
BES89.2986.0090.9889.1252.47
LibD3C90.3587.9591.2790.4853.41
CDM91.5189.6193.1791.3354.47
Proposed scheme93.1491.2993.7692.1458.17

NB: Naïve Bayes algorithm, SVM: support vector machines, ESM: ensemble selection from libraries of models, BES: Bagging ensemble selection, LibD3C: hybrid ensemble pruning based on k-means and dynamic selection, and CDM: ensemble pruning based on combined diversity measures.