Review Article

Arabic Sentiment Analysis: A Systematic Literature Review

Table 5

Overview of the data extracted from related articles in ASA.

RefSA taskApAlgorithmSA levelDS/sizeDomainLanguageFeaturesAccuracy (%)

[17]ADUnSyntax-based approachAspect15000, 15000Hotel, productsMSA, DA65.32
[28]BR&SCSuSVMAspect5KAirlineMSA, DASkip-gram, FastText89
[34]BR&SCSuSVM363MSA, DA76.09
[25]BR&SCSuKNN, NB, SVM3015Food, sports, weatherMSA, DA76.33
[109]SCSuBagging, NB, KNN, DTD10, D11, D12MSA, DASMOTE74.37;74.9, 84.02
[40]BR&SCLBCorpus and LB approachWord15,27413 domainsMSA, DA, CATF-IDF, N-gram93
[43]BR&SCSuSVMSentenceD23, D05, D03, D17, D20MSA, DATF-IDF, LSA, CBOW, SG83.02
[110]BR&SCSuDCNNSentence2390NewsMSAWord2ve, N-gram69.9
[44]BR&SCSuSVM, LGR183531ProductsMSABoW, TF-IDFBal:77.76
Unbal:91.21
[16]BR&SCSuNB, SVM, RFT10254ElectionsMSA, DAN-gram, TF, TF-IDF77
[49]BR&SCSuSVM, KNN, BNB5986MSA, DAN-gram, TF, TF-IDFF-m = 88.8
[45]SCSuRFT, GNB, LGR, SGDD09, D05MSA, DACBOW, SG87.10
[53]SCHbBagging, SVM, RFT, NBD10, D11MSA, DAUnigram, bigram90.4; 90
[55]SCLBSVM, ME, Bagging, Boosting, RFT, NNET, DT, NBD16, D03, lex = 537695.98
[111]SCSuNB, NB-MLP2154; 13420; 1353; 3962; 8522Attraction, hotel, movies, products, restaurantsMSA, DA99.8; 85.1; 95.4; 97.3; 93.1
[50]BR&SCSuNB, SVM48Stock marketMSA, DAN-gram, TF-IDF, BTO83.58
[51]BR&SCHbLB, LSTM, CNN, SVM, LGR, NB, DT, RFTD24, D05MSA, DATF-IDF, CBOW87.5; 81
[57]Subj CSuDL1100MSA, DATF-IDF92.96
[65]SCSuNB, SVMD09BooksMSA, DATF-IDF, N-gram90.98
[67]BR&SCSuNB, SVMNews for multidomainsMSA, DAUnigram, bigram, TF, TF-IDF84,56
[112]SCSuCNN-LSTMCharacter, Ch5Gram, WordD01, D03, D05HealthMSATrigram, ReLU95.68; 77.62
94.24; 88.10
[69]BR&SCHbSO-NBSentence1200MSAN-gram models, semantic features90
[79]BR&SCHbLGR, PAG, SVM, PRN, RFT, ABT, LBReviewD13HotelMSA, DAUnigram, bigram, semantic features, BOW94 to 97
[113]Aspect SASuNB, BYN, DT, KNN, SVMSentenceD19HotelMSAMorphological, syntactic, semantic features95.4
[56]BR&SCSuNB, SVM, DT, RFT1543347; 1462PoliticsMSASyntactic, surface-form, sentiment features71.95
[47]BR&SCSuNBSentence18278ElectionsMSA, DAUnigram, bigram, IG93.13
[59]BR&SCSuSVM, NBDocumentD21News, politics, sports, cultureMSA, DAUnigram, bigram90.20
[46]BR&SCHbSO, SVM, NBSentence1520DAN-gram, CountVectorizer92.98
[114]SCSuCNNSentenceD05, D20MSA, DAGlove, SG, CBOW, random word vectors72.1463
[60]BR&SCSuSVM, NB, KNNSentence996MultidomainMSA, DAUnigram, bigram, BTO, TF-IDF78
[73]SCSuDTDocumentD16, D06Movies, hotelMSA, DA93.83; 90.63
[80]Subj C, &SCHbDT, NB, KNN, Ontology Baseline, LBAspectD09, 2000Books, HotelMSADomain featuresf-m = 79.18;78.83
[81]BR&SCHbSVM, NB, KNNSentence3476Movies, economy, sports, history, politicsMSA, DA97.44
[61]BR&SCSuSVM, KNN, NB, DT, LEM2Document4812DARough set method74
[66]BR&SCSuRNTNPhrase, sentence1177MSAOrthographic, morphological features80
[35]SCSuSVM, NB, MNB, SGD, DTReviewD09BooksMSA, DAUnigram, bigram, TF-IDF, genetic algorithm94
[32]BR&SCUnLBDocument1000, 1000News, artsDA73–96
[15]BR&SCSuSVMSentence, documentD19HotelMSAUnigram76.42
[83]BR&SCHbKNN, SVM, LLR, NB, NEUNETDocument886115 domainsDAPolarity scores, dialects, synsets, inflected forms97.8
[84]Aspect SAUnLBReview200MSASentiment features92.15
[86]BR&SCHbSVM, NB, LBSentence64342DAN-gram70
[115]SCSuStacking, SGD, RFT, LGR, GNBSentence1350DAWord2vec, SMOTE85.28
[54]SCSuSVM, NBSentence9096DABOW, bigram, trigramf-m = 73
[68]BR&SCSuSVM, NBSentence1800MSA, DAN-gram models, TF-IDF88.72
[88]SCSuMLP, LGRSentenceD09MSA, DADoc2vec32.38
[90]SCSuCNN, LSTM, RNNDocumentD05, D03DACBOW, SG, mul, CONC81.63; 87.27
[82]SCSuSVM, DT, NB, KNN, HCsDocumentD09MSA, DABOW, correlation analysis72.64
[48]BR&SCSuDT, RFT, SVMSentence10254PoliticsMSA, DAN-gram models, TF-IDF, TF81
[91]SCSuDNN (LSTM), RNNSentenceD09MSA, DAWord embedding, BOW71
[14]SCSuRNN, SVMSentenceD19MSAword2vec, lexical, morphological, semantic features, N-gram95.4
[116]Subj C&SCSuSVMSentenceD04MSA69.37
[104]BR&SCSuSVM, NBSentence1121EducationMSAN-gram84.62
[71]SCSuSVM, NBReviewD16MoviesMSAN-gram models, TF-IDF, BTO, TF96.67
[29]SCSuSVM, LGDReviewUnbal, bal (D09)BooksMSA, DABOW, N-gram,88.51, 78.14
[30]BR&SCSuLGR, PAG, SVM, PRNReviewUnbal, bale (D07)BooksMSA, DAUnigram, bigram0.744–0.911; 0.847–0.85
[13]BR&SCSuSVM, NBDocument1331ProductsMSA, DAN-gram, BTO; TF-IDF89.68
[89]SCSuSVM.NB, KNN, ANNSentence500HotelMSAf-M = 92
[36]BR&SC, Subj CSuSVM, NBSentence3700MSA, DAN-gram models89.55
[58]SCSuARD16MoviesMSA, DAIG, chi-square, GI86.81
[93]BR&SCSuLexical semantic, CRFSentence381NewsMSAPosition, bigram, trigram, morphologicalf-m = 84.93
[94]BR&SCSuSVM, NB, KNNDocument250ProductsMSA, DAN-gram models94
[95]BR&SCSuSVM, MNBSentence134194MSA, DAAutomatic labeling75.7
[117]SCSuRule-based approachSentenceD16, 2000Movies, arts, politicsMSA, DA85.6, 93.9
[97]BR&SC, Subj CSuSVM, NB, DT, RFTSentencePoliticsMSAN-gram, TF-IDFP = 70.97
[98]SCSuSVM, KNN, NB, DT, RFTSentenceTerrorismMSALexical, surface-form, N-gramP = 71.76
[99]SCSuSVMAspectRestaurant reviews, 1000, D16,Novels, products, movies, sports, hotels, restaurantsMSAN-gram85.35
90.60
96.00
[100]BR&SCSuSVM, BPNN, NB, DTSentence2000MSA, DA96.06
[62]BR&SCSuSVM, NBSentence18278PoliticsDAN-gram, TF-IDF
[118]SCSuSVM, CNB, MNBSentenceD10, D11, D12MSAN-gram, TF, IDF, TF-IDF, IG77.34
[37]BR&SCHbSO-SVM-NBSentence4800NewsDAN-gram models80.9
[92]SCSuSVM, NB, KNNSentence3073Politics, artsMSA, DAN-gram68.69
[74]BR&SCSuSVM, NB, KNNSentence2,591Education, sports, politicsMSABTO, TF-IDF, TF69.97
[63]BR&SCLBLBSentence, documentD09, D14Health, bookMSA, DA71
[38]BR&SCSuNB, SVM, MEDocument28576DABoW, N-gram models86.75
[41]SCSuSVM, NB, DTSentence2000Politics, artsMSAP = 80.9
[101]BR&SCHbLB, NBSentence2590RestaurantsDASentiment words90.54
[102]BR&SCSuNB, DTSentence2000MSA64.85
[33]BR,&SC, Subj CSuRFT, GNB, SVM, LGR, SGDWordD05, D03, D17, D09Quran, customersMSA, DACBOW80.21–81.69
[119]SCLBRule-based, LBD16MSA89.6
[18]BR&SCSuCRF, DT, NB, KNNSentence2265NewsMSANER, N-gram86.5
[42]BR&SCHbSO, SVMSentence1103MSA, DAN-gram models84.01
[103]BR&SCLBLBWordD25, D05, D18, Lex: D25-PMIDA89.58
[87]BR&SCSuSVMSentence625MultidomainMSA, DA83.5
[96]BR&SCSuBagging, boostingSentence1500Sports, news, EconomicsMSA, DA85.95
[72]BR&SCSuSVM, NBDocumentD02MSA, DAM-P = 90.5
[31]BR&SCSuSVM-KNNReview625HotelDA97
[105]SCSuDT, Dtable, SVM, MNB, voting (KNN, DT, and NB)SentenceUnbal, bal : D09MSABOW42.7–46.4
[120]BR,&SC, Subj CSuSVM, MNB, BNB, PAG, SGD, LGR, PRN, KNNSentenceD05DAN-gram models69.1
[121]BR&SCSuVoting (RSS and SVM)Sentence800NewsMSA, DA98
[64]SCSuVoting (ME, SVM, and ANN), bagging, boosting, stackingDocumentD16, D11-D13MSAStylistic, morpholexical, tigram, brigram, TF-IDF, TF, BTOF1 = 85.06
[122]BR&SCSsSVM, LGR, BNB, KNN, SGDDocumentD09Movies, hotels, restaurants, productsMSAUnigram, bigram, TF-IDF, word count82.4
[106]BR&SCSuSVM, NB, KNNSentence500MSAf-m = 91.5
[70]BR&SCSuNB, KNNDocument2591Education, sports, politicsMSA, DABigramM-P = 75.25
[26]BR&SCSuSVM, NB, KNNSentenceD16, 164PoliticsMSA, DACorrelation, N-gram models96.6
[107]BR&SCSuNB, KNNSentence300MSA63.79
[75]BR&SCSuSVM, NBSentence300, 250DAN-gram models0.75.64
[123]SCSuSVM, NBDocument5070News, entertainment, sports, science, businessMSAChi-square, correlation, GSS coefficient, IG, relief FM-F = 95.1
[76]BR&SCSuNB, votingDocument4812DARough set method, genetic57
[52]BR&SCSuSVM, NBDocumentD22, D16, 7400MSABTO, TF, TF-IDF, score94.88–97.81
[124]BR&SCSsPattern matching, majority with entitiesSentence5000RestaurantsDA60.5
[24]Subj C, and SCSuSVM, NBObama, Messi, iPhone, shiaPolitics, social, productsMSAN-gram, IDF, TF-IDF68.05; 87.43
[85]BR&SCSuSVM, MNBSentence260DAN-gram72.78
[39]BR&SCSuSVMSentence1350NewsDAP = 88.63
[77]SCHbVoting (SVM, NB, and SO)SentenceD16, D22MSAN-gram modelsF1 = 90.74
[78]SCSuNB, SVM, KNNDocumentD16, D10, D11MSATF-IDF, TF, IDF, N-gram93.0, 78.2
[125]SCSuMNB, BNB, SVMSentenceUnbal, bal : D09MSA, DA91.0, 82.7
[27]BR&SCSuSVM, NBSentenceUnbal, bal: 4625Arts, politics, science, technology, socialMSA, DA68.2, 61.4
[12]BR&SCSuKNN, NB, DT, SVMSentence1000MSASocial mention, Senti. strength95.59, 93.29