Research Article

A New Random Forest Algorithm Based on Learning Automata

Table 2

Details of textual data used for evaluation.

DomainName# Feature# Instance

TextStanford—Sentiment 140 corpus [106]Bag of word1600000
Large dataset of movie reviews [107]Bag of word50000
Sentence polarity dataset v1.0 [108]Bag of word10662
Internet movie database [105]Bag of word1400
Yelp review [105]Bag of word598000
Amazon review [105]Bag of word1000000
HealthcareHeart disease dataset [105]13200
Breast cancer dataset [105]30569
Arrhythmia dataset [105]279454
Parkinson dataset [105]45241
Caesarean section dataset [105]581
Gene expression dataset [105]255801
Diabetes dataset [105]7765
Statlog (heart) dataset [105]13271
PhysicalIonosphere dataset [105]34352
Sonar, mines vs. rocks dataset [105]60208
SoundVoice dataset [105]203168
Emotions from music dataset [105]28592