Research Article

Predicting Coronavirus Pandemic in Real-Time Using Machine Learning and Big Data Streaming System

Table 5

The performance of the RF model.

Feature extraction methodMatrix sizeTesting performanceCross-validation performance
AccuracyPrecisionRecallF1-scoreAccuracyPrecisionRecallF1-score

Unigram100083.3684.6983.3682.7386.43 ± 0.4887.41 ± 0.4686.37 ± 0.4986.02 ± 0.57
300084.7185.884.7184.0689.56 ± 0.3490.05 ± 0.4689.62 ± 0.3489.3 ± 0.48
Bigram100083.0584.4183.0582.3885.79 ± 0.5186.87 ± 0.5285.81 ± 0.4985.4 ± 0.61
300084.785.8184.784.0989.48 ± 0.3689.79 ± 0.4589.44 ± 0.3589.12 ± 0.45
Trigram100083.1184.4983.1182.4585.81 ± 0.4686.93 ± 0.4985.76 ± 0.4185.4 ± 0.44
300084.6785.8284.6784.0489.39 ± 0.4489.9 ± 0.3589.48 ± 0.3989.2 ± 0.33
Four-gram100083.0784.4983.0782.4685.29 ± 0.5186.62 ± 0.5785.34 ± 0.5784.96 ± 0.56
300084.6185.8284.618489.41 ± 0.4489.85 ± 0.4189.4 ± 0.3989.11 ± 0.41