Research Article

Effective Preprocessing and Normalization Techniques for COVID-19 Twitter Streams with POS Tagging via Lightweight Hidden Markov Model

Table 2

Conventional text normalization methods and techniques.

TechniqueAbbreviationsRepeated charactersMisspelled words

Regular expressionXX
Replace() function using WordNetX
Expanding abbreviations by CSV file replacementXX
Probability model using edit distanceX
Spell correction using TextBlobXX
NLTK libraryX
Phonetic edit distanceX
PyEnchant libraryX