Research Article

An Improved Oversampling Algorithm Based on the Samples’ Selection Strategy for Classifying Imbalanced Data

Table 1

The information of the imbalanced data sets in the experiments.

Data set IDData sets#Abb#Attr#Min#MajIRData source

1BananaBanana27528080.03KEEL

2Haberman’s SurvivalHaberman3812250.36UCI

3BupaBupa61452000.73UCI

4AppendicitisAppendicitis721850.25KEEL

5Pima Indians DiabetesPima82685000.54KEEL

6German Credit DataGerman203007000.43UCI

7Vehicle SilhouettesVehicle181996470.31KEEL

8Led7digitLed7524480.12UCI

9WisconsinWisconsin92414580.53UCI

10WineWine13481300.37UCI