Research Article

Effective Preprocessing and Normalization Techniques for COVID-19 Twitter Streams with POS Tagging via Lightweight Hidden Markov Model

Table 6

Preprocessing techniques and method used.

Preprocessing techniquesMethods deployed

Stop–word removalRainbow list
StemmingSnowball stemmer
EmoticonRegular expression
TokenizationUnigram, bigram, and -gram
Weighting schemeTF-IDF