Research Article
Application of Data Mining Technology on Surveillance Report Data of HIV/AIDS High-Risk Group in Urumqi from 2009 to 2015
Table 5
Description of original data and balanced data.
| Dataset | Minority class | Majority class | Samples in total | Imbalance rate |
| MSM (original) | 377 | 4927 | 5304 | 13.0689 | MSM (SMOTE) | 4901 | 4976 | 9877 | 1.0153 | FSW (original) | 49 | 9041 | 9090 | 184.5102 | FSW (SMOTE) | 9849 | 9898 | 19,747 | 1.0049 | IDU (original) | 1250 | 6087 | 7337 | 4.8696 | IDU (SMOTE) | 6250 | 6300 | 12,550 | 1.008 |
|
|