Research Article

A Novel Selective Ensemble Algorithm for Imbalanced Data Classification Based on Exploratory Undersampling

Algorithm 4

RotEasy algorithm.
(i)  Input: : the majority set, : the minority set, : the number of subsets undersampling from ,
    : the number of inner-layer ensemble, : validation dataset, : tradeoff parameter,
    : the set of class labels.
(ii)  For     to     do
  (a) Randomly undersampling a subset from , .
  (b) Learning the inner-layer ensemble :
    (1) Set , the weak classifier , initial weight
     distribution on the training set as .
    (2) for     to     do
    (3) Calculate the rotation matrix using , based on Algorithm 2.
    (4) Get the sampling subset , using weight distribution .
    (5) Learn by providing the transformed subset , as the input of classifier .
    (6) Calculate the training error over : .
    (7) Set the weight : .
    (8) Update over : ,
       where is the normalization constant: .
    (9) Endfor
(iii) Endfor
(iv)  Pruning: Apply the DREP method on the validation subset to prune the ensemble .
Denote the pruned ensemble members as , their corresponding normalized weights
and rotation matrices .
(v)  Classification Phase: For a given , calculate its class label as follows: