Research Article

[Retracted] The Use of Hellinger Distance Undersampling Model to Improve the Classification of Disease Class in Imbalanced Medical Datasets

Table 9

The average results using classifiers (KNN, SVM, and DT) with models (baseline, Tml, RUS, ENN, and HDUS) on four datasets.

BaselineTmlRUSENNHDUS

KNNSensitivity (%)30.7036.5558.3747.0262.28
Specificity (%)88.5481.1865.1274.3465.39
Precision (%)37.5433.4738.6037.2436.88
F1_m (%)33.5934.9246.0640.8245.76
Bacc (%)59.3758.8761.7760.6863.84

SVMSensitivity (%)7.0333.6167.6342.5773.38
Specificity (%)96.6688.7160.0180.6260.36
Precision (%)20.0637.8638.4938.7339.10
F1_m (%)10.3435.1948.0339.9849.85
Bacc (%)51.8561.1663.9461.6966.87

DTSensitivity (%)39.8241.9155.2555.0482.91
Specificity (%)75.3470.5154.0265.8957.44
Precision (%)36.6033.6130.1036.6641.65
F1_m (%)37.8636.2338.6043.5954.79
Bacc (%)57.5856.2154.6460.4770.17