Research Article

[Retracted] The Use of Hellinger Distance Undersampling Model to Improve the Classification of Disease Class in Imbalanced Medical Datasets

Table 3

The features of THS dataset.

No.Attribute nameData type

1DiagnosisCategorical
2Forced vital capacityNumeric
3A volume that has been exhaled at the end of the first of forced expirationNumeric
4Performance statusCategorical
5Pain before surgeryCategorical
6Hemoptysis before surgeryCategorical
7Dyspnoea before surgeryCategorical
8Cough before surgeryCategorical
9Weakness before surgeryCategorical
10Size of the original tumourCategorical
11Type 2 diabetes mellitusCategorical
12Myocardial infarction up to six monthsCategorical
13Peripheral arterial diseasesCategorical
14SmokingCategorical
15AsthmaCategorical
16Age at surgeryNumeric
17One-year survival period (class)Categorical