Research Article

Movie Review Summarization Using Supervised Learning and Graph-Based Ranking Algorithm

Table 4

Movie review classification accuracy on three tasks.

FeaturesPL04Full IMDBSubjectivity

1Unigrams with NB81.586.6690.75
2Bigrams with NB77.788.2976.03
3Unigrams + bigrams with NB82.488.9191.22
4Unigram frequency + smoothed IDF + cosine normalization82.187.3690.7
5Bigram frequency + smoothed IDF + cosine normalization81.1588.3176.72
6Unigrams + bigrams + smoothed IDF + cosine normalization83.789.2890.91
10Benchmark model [62]88.9088.8988.13

PL04 refers to the collection of 2000 movie reviews often used as benchmark dataset for sentiment classification [61], Full IMDB dataset is a collection of 50,000 reviews, and sentence subjectivity dataset is a collection of 1000 movie reviews [61].