Research Article

Predicting Coronavirus Pandemic in Real-Time Using Machine Learning and Big Data Streaming System

Table 3

The performance of the KNN model.

Feature extraction methodMatrix sizeTesting performanceCross-validation performance
AccuracyPrecisionRecallF1-scoreAccuracyPrecisionRecallF1-score

Unigram100062.1569.9462.1558.9965.75 ± 0.5273.33 ± 0.6565.75 ± 0.5263.5 ± 0.6
300063.7770.1163.7759.3668.36 ± 0.6174.72 ± 0.5468.36 ± 0.6165.7 ± 0.76
Bigram100062.9770.9662.9759.566.09 ± 0.5973.85 ± 0.8966.09 ± 0.5963.74 ± 0.76
300064.4971.0264.4959.9669.13 ± 0.7676.04 ± 0.5669.13 ± 0.7666.44 ± 0.97
Trigram10006370.726359.5766.08 ± 0.5473.69 ± 0.7166.08 ± 0.5463.75 ± 0.65
300064.5470.6964.5460.0769.07 ± 0.7575.61 ± 0.6669.07 ± 0.7566.39 ± 0.96
Four-gram100062.9371.2462.9359.5366.09 ± 0.6373.76 ± 0.9366.09 ± 0.6363.75 ± 0.8
300064.6271.0464.6260.0669.25 ± 0.8276.16 ± 0.6669.25 ± 0.8266.56 ± 1.05