Research Article

Cost-Sensitive Support Vector Machine Using Randomized Dual Coordinate Descent Method for Big Class-Imbalanced Data Classification

Table 1

Specification of the benchmark datasets. Number of ex. is the number of example data points. Number of feat. is the number of features. Ratio is the class imbalance ratio. Target specifies the target or positive class. Number is the number of the datasets.

DatasetNumber of ex.Number of feat.RatioPositive classNumber

KDD99 (intrusion detection)5,209,460421 : 4Normal1
Web span350,0002541 : 2−12
MNIST70,0007801 : 1053
Covertype581,012541 : 211 Cottonwood/Willow(4)4
SIAM128,59630,4381 : 20001, 6, 7, 115
SIAM1128,59630,4381 : 71611, 126