Research Article

Fuzzy Aspect Based Opinion Classification System for Mining Tourist Reviews

Table 1

Critical evaluation of aspects extraction methods.

ReferenceExplicit aspectsImplicit aspectsCoreferential aspectsIrrelevant aspectsMethodAspects selectionResults
FrequentInfrequent

Marrese-Taylor et al., 2014 [19]HighNullNullNot handledHandledRules basedFrequent nouns30%
Marrese-Taylor et al., 2013 [18]HighNullNullNot handledHandledRules basedFrequent nounsNot given
de Albornoz et al., 2011 [17]HighNullNullHandledHandledRules basedRelative importance66.8%
Muangon et al., 2014 [16]HighNullNullNot handledHandledRules basedRankingNot given
Pekar and Ou, 2008 [15]HighNullNullNot handledHandledRules basedFrequent nounsNot given
Hai et al., 2014 [20]HighLowNullNot handledNot handledRules basedDomain-specific nouns65%
Colhon et al., 2014 [21]HighLowNullNot handledHandledSeeds basedGrammatical relationshipNot given
Mukherjee and Liu, 2012 [22]HighLowNullNot handledHandledSeeds basedHigher-order cooccurrences77%
Wang et al., 2010 [23]HighLowNullNot handledHandledSeeds basedMaximum term overlappingNot given
Zhu et al., 2011 [24]HighLowNullNot handledHandledSeeds wordsFrequency of cooccurrence69%
Wu and Ester, 2015 [25]HighMediumNullNot handledNot handledTopic model basedConnected topicsNot given
Xianghua et al., 2013 [26]HighMediumNullNot handledNot handledTopic model basedMinimum distance with topics73%
Xueke et al., 2013 [27]HighMediumNullNot handledNot handledTopic model basedFrequent topicsNot given
Proposed methodHighMediumHighHandledHandledFuzzy basedFURIA rules81%

We represent aspects into three types: frequent explicit aspects, infrequent explicit aspects, and implicit aspects shown in 2 to 4 columns of this table. Coreferential and irrelevant aspects handing shown in column 5 and column 6, respectively. We labeled these columns by null, low, medium, high, handled, and not handled.
“Null” = not extracting those types of aspects.
“Low” = extracting 10 to 40% of those types of aspects.
“Medium” = extracting 40 to 70% of those types of aspects.
“High” = extracting 70 to 100% of those types of aspects.
“Handled” = handling those types of aspects.
“Not handled” = not handling those types of aspects.