Research Article

A Gamma-Poisson Mixture Topic Model for Short Text

Table 1

Notation.

Description

Number of documents in the corpus
Size of the vocabulary
Number of topics
Length of th document
Collection of documents
Frequency vector of th document
Number of times word occurs in the th document
Vector of topic assignments of each document
Topic assignment of document
Number of documents in topic
Number of times word is observed in topic
Number of words in topic
If is a quantity that describes a characteristic of the corpus, denotes the same characteristic of the corpus excluding the th document