Research Article

Handling Imbalance Classification Virtual Screening Big Data Using Machine Learning Algorithms

Table 2

Complete set of experiments and average G-mean results.

AlgorithmPaDEL numeric descriptorPaDEL fingerprint
No-sampleSMOTEKSMOTETimeNo-sampleSMOTEKSMOTETime

AID 440RF0.1670.5650.954230.290.4420.9612
DT0.50.590.9379.30.510.4590.9584.9
MLP0.60.50.963200.4770.4980.9649.6
LG0.560.670.963110.4130.5120.965.6
GBT0.230.560.963330.4770.4210.96317.1

AID624202RF0.4450.6250.95229.70.50.6280.9615.3
DT0.5760.6140.94100.540.5640.945
MLP0.740.7150.9525.20.6360.4970.95813.5
LG0.6280.830.9426.80.7910.780.83713.25
GBT0.4890.610.95450.4950.4820.95422.36

AID 651820RF0.7220.7920.956410.7410.7980.9219.25
DT0.7250.720.9328.780.7650.7430.894.44
MLP0.820.8170.915350.7880.80.9117.3
LG0.7790.83570.962190.750.7680.899.36
GBT0.7140.7420.960.50.7620.7660.90529.9