Research Article
Biomedical Text Categorization Based on Ensemble Pruning and Optimized Topic Modelling
Box 1
The generative process of LDA (Blei et al., 2013; [
19,
20]).
For each document in a corpus D: | Choose N Poisson (ξ). | Choose Θ Dir (α). | For each of the N words : | (a) Choose a topic zn Multinomial (Θ). | Choose a word from p(, β), a multinomial probability conditioned on the topic zn. |
|