Research Article

Android Malware Characterization Using Metadata and Machine Learning Techniques

Table 4

Full benchmark test with top-15 predictors.

Malware NumDetectors F1-score Precision Recall

Logistic Regression (train/test)

2%10.82/0.10.76/0.060.89/0.24
25%10.89/0.570.87/0.460.91/0.73
50%10.93/0.790.94/0.820.93/0.77

2%20.8/0.180.74/0.120.88/0.33
25%20.9/0.680.89/0.590.9/0.79
50%20.94/0.820.95/0.780.93/0.86

2%40.81/0.270.75/0.190.89/0.47
25%40.91/0.730.9/0.650.91/0.83
50%40.95/0.840.97/0.790.94/0.89

Support Vector Machine (train/test)

2%10.85/0.080.76/0.050.96/0.23
25%10.93/0.680.92/0.690.93/0.67
50%10.96/0.820.96/0.870.95/0.77

2%20.82/0.160.72/0.10.95/0.35
25%20.93/0.730.93/0.70.93/0.76
50%20.96/0.840.97/0.90.94/0.8

2%40.81/0.260.7/0.170.97/0.54
25%40.94/0.770.94/0.720.93/0.83
50%40.96/0.870.98/0.890.95/0.84

Random Forest (train/test)

2%10.99/0.120.99/0.070.99/0.33
25%10.99/0.730.99/0.70.99/0.77
50%10.99/0.840.99/0.880.99/0.8

2%20.99/0.220.99/0.150.99/0.46
25%20.99/0.770.99/0.730.99/0.83
50%20.99/0.870.99/0.890.99/0.86

2%40.99/0.320.99/0.220.99/0.59
25%40.99/0.810.99/0.760.99/0.87
50%40.99/0.890.99/0.880.99/0.9