Research Article
Biomedical Text Categorization Based on Ensemble Pruning and Optimized Topic Modelling
Table 1
Descriptive information for the datasets.
| Dataset | Number of documents | Number of terms | Average occurrence of terms | Number of classes |
| Oh5 | 918 | 3013 | 54.43 | 10 | Oh10 | 1050 | 3239 | 55.63 | 10 | Oh15 | 3101 | 54142 | 17.46 | 10 | Ohscal | 11162 | 11466 | 60.38 | 10 | Ohsumed-400 | 9200 | 13512 | 55.14 | 12 |
|
|