Research Article

Machine Learning Approach for Answer Detection in Discussion Forums: An Application of Big Data Analytics

Table 2

Fourteen lexical, content-based, and semantic features (including proposed semantic features F1, F16, and F17).

CodeAbbreviation

F1ThrdCntrodRplyWMDistance
F2ThrdCentrodRplyCosnSmlrty
F3TtlRplyCosnSmlrtyWholCrps
F4QustionRplyCosnSmlrtyWholCrps
F5TtlRplyCosnSmlrty
F6QustionRplyCosnSmlrty
F7UnqWrds
F11ReWrdsOvrlpInitialPost
F12ReWrdsOvrlpThrdTitl
F13IsRplyContan5WHWrds
F15IsRplyHvHyperlnk
F16“WMDbtwnTitlRpl”
F17“WMDbtwnQustionRpl”
F20NoWrdsRply