Research Article

Effective and Efficient Android Malware Detection and Category Classification Using the Enhanced KronoDroid Dataset

Table 7

Related work on malware detection and category classification using different datasets.

SourceApproach and yearName/Year of datasetTime frameFeatures and approachDynamicMalware detectionCategory classification

[42]Semisupervised learning (2018)2016N/AHybrid feature selection: aggregate informationEmulatorAccuracy: 91.23%N/A (dynamic accuracy: 80.3%)
[38]Deep artificial Neural network (2021)CICANDMAL2019N/AHybrid feature selection: N/AReal deviceStatic accuracy: 93.40%4 categories (static accuracy: 92.5%) (dynamic accuracy: 80.3%)
[43]Machine learning (2018)Updroid2014–2018Hybrid feature selection: N/AEmulatorDetection as categorizationAccuracy: 96.37%
[44]Deep learning (2020)DL-driod dataset 2019N/AHybrid Feature ranking: InfoGainReal deviceAccuracy: 98.5%N/A
[18]Deep neural network with pseudolabel (2020)CICMALDroid20202017-2018Dynamic feature selection: N/AEmulatorDetection as categorization4 categories (F1-score: 97.8%)
[7]Machine learning (2021)CCS-CICAndMal2020N/ADynamic feature selection: N/AEmulatorDetection as categorization12 categories (precision: 98.4%)
[45]Machine learning (2017)Drebin,Mcafee PraguardN/AStatic feature selection: mean decrease impurity (MDI)N/A99.82%99.26% into families
[46]Machine learning (2018)Drebin, Genome VirusShare2009–2017Dynamic feature selection: N/AEmulator97.4%97.8% into families
[47]Machine learning (2020)VirusShare AndroZoo2010–2017Dynamic feature importance: Top100EmulatorF1: 92.88% (same year)
F1: 71.81% (across year)
N/A
[48]Machine learning (2021)APKPure, random datasetN/AStatic, feature selection: LR basedN/AAccuracy: 96.3%N/A
[49]Machine learning (2021)APKPure, VirusShareN/AStatic, feature selection: filter-basedN/AF-measure: 95%N/A
[50]Machine learning (2022)Mendeley repositoryN/ADynamic, features selection: embedded BFEEmulatorF-measure: 99%N/A
[51]Machine learning (2022)Multiple repositoriesN/ADynamic, feature selection: rough set analysis (RSA) and principal component analysis (PCA)EmulatorDetection rate: 98.8%N/A
[52]Machine learning (2023)AndroZoo and DrebinN/AStatic, feature selection: wrapper based (DDQN)N/ADetection rate: 95.6%N/A
[23]Deep neural network with pseudo label stack auto encoder (2022)CICMALDroid20202017-2018Hybrid feature selection: N/AEmulatorAccuracy: 98.28%5 categories (precision: 98.4%)
[53]Ensemble (random forest)KronoDroid2008–2020Hybrid, features selection: Chi-squaredReal and emulated deviceAccuracy: 95%
Precision: 95%
N/A
OurMachine learning (random forest)Subset of 2020 KronoDroid2008–2020Hybrid, filter-based features selectionReal deviceAccuracy: 98.03%15 categories (accuracy: 87.6%)

The values in bold emphasizes aspects of this research that are important in comparison to existing research.