Research Article

Identification of Potential Type II Diabetes in a Large-Scale Chinese Population Using a Systematic Machine Learning Framework

Table 3

Dataset description.

DatasetSample distributionRatioDescription

Original data510,411/72,0277 : 1Original data with full instances
SMOTE data510,411/510,4111 : 1Dataset is balanced utilizing SMOTE oversampling