Research Article
Identification of Potential Type II Diabetes in a Large-Scale Chinese Population Using a Systematic Machine Learning Framework
| Dataset | Sample distribution | Ratio | Description |
| Original data | 510,411/72,027 | 7 : 1 | Original data with full instances | SMOTE data | 510,411/510,411 | 1 : 1 | Dataset is balanced utilizing SMOTE oversampling |
|
|