Research Article
Imbalanced Data Set CSVM Classification Method Based on Cluster Boundary Sampling
Table 1
The basic information of four UCI data sets.
| Data set | Number of negative samples | Number of positive samples | Imbalance ratio | Data description |
| Shuttle | 57829 | 171 | 338 : 1 | High imbalance ratio High information amount | Abalone | 4145 | 32 | 130 : 1 | High imbalance ratio Low information amount | Yeast | 1433 | 51 | 28 : 1 | Low imbalance ratio Low information amount | Churn | 4293 | 707 | 6 : 1 | Low imbalance ratio High information amount |
|
|