Research Article

[Retracted] The Use of Hellinger Distance Undersampling Model to Improve the Classification of Disease Class in Imbalanced Medical Datasets

Table 6

Evaluation results for PIMA dataset using classifiers (KNN, SVM, and DT) and models (baseline, Tml, RUS, ENN, and HDUS).

PIMABaselineTmlRUSENNHDUS

KNNSensitivity (%)60.0664.5270.7770.5783.87
Specificity (%)83.858073.8574.9262.50
Precision (%)63.1656.6156.4156.4650.35
F1_m (%)61.5760.3162.7862.7362.92
Bacc (%)70.9672.2672.4172.7573.19

SVMSensitivity (%)066.1374.427479.03
Specificity (%)10083.8573.8576.9271.50
Precision (%)058.1358.5458.0457.00
F1_m (%)061.8765.5365.0666.23
Bacc (%)5074.9974.6375.8675.27

DTSensitivity (%)61.2969.3570.9770.9791.90
Specificity (%)79.2366.9258.4668.4666.77
Precision (%)58.465044.950.7656.98
F1_m (%)59.8458.115559.1970.34
Bacc (%)70.2668.1464.7169.7179.34

AVGSensitivity (%)39.7866.6772.0571.8584.93
Specificity (%)87.6976.9268.7273.4366.92
Precision (%)40.5454.9153.2854.0954.78
F1_m (%)40.1660.2261.2661.7266.50
Bacc (%)63.7471.870.5872.7775.93