Research Article

Machine Learning Approach for Answer Detection in Discussion Forums: An Application of Big Data Analytics

Table 3

Top 11 features selected by the chi-square technique for Ubuntu dataset.

CodeAbbreviation

F1ThrdCntrodRplyWMDistance
F2ThrdCentrodRplyCosnSmlrty
F7UnqWrds
F8IsRplyByCrtrOfInitlPost
F9NumRepliesByUsrCurrentThrd
F13IsRplyContan5WHWrds
F15IsRplyHvHyperlnk
F16“WMDbtwnTitlRpl”
F17“WMDbtwnQustionRpl”
F19TotlNoIntialPstsByUser
F20NoWrdsRply