Mathematical Problems in Engineering

Research Article

A Novel Selective Ensemble Algorithm for Imbalanced Data Classification Based on Exploratory Undersampling

RotEasy algorithm.

(i) Input: : the majority set, : the minority set, : the number of subsets undersampling from ,
: the number of inner-layer ensemble, : validation dataset, : tradeoff parameter,
: the set of class labels.
(ii) For to do
(a) Randomly undersampling a subset from , .
(b) Learning the inner-layer ensemble :
(1) Set , the weak classifier , initial weight
distribution on the training set as .
(2) for to do
(3) Calculate the rotation matrix using , based on Algorithm 2.
(4) Get the sampling subset , using weight distribution .
(5) Learn by providing the transformed subset , as the input of classifier .
(6) Calculate the training error over : .
(7) Set the weight : .
(8) Update over : ,
where is the normalization constant: .
(9) Endfor
(iii) Endfor
(iv) Pruning: Apply the DREP method on the validation subset to prune the ensemble .
Denote the pruned ensemble members as , their corresponding normalized weights
and rotation matrices .
(v) Classification Phase: For a given , calculate its class label as follows: