Research Article

Machine Learning Approach for Answer Detection in Discussion Forums: An Application of Big Data Analytics

Table 1

All twenty features with brief description.

CodeAbbreviationDescriptionFeature typeSubtype

F1ThrdCntrodRplyWMDistanceWord mover distance of a reply from the thread centreLexicalSemantic
F2ThrdCentrodRplyCosnSmlrtyCosine similarity of a reply with the thread centreLexicalPure lexical
F3TtlRplyCosnSmlrtyWholCrpsCosine similarity of a reply with the title based on corpus created from all threadsLexicalPure lexical
F4QustionRplyCosnSmlrtyWholCrpsCosine similarity of a reply with the initial post based on corpus created from all threadsLexicalPure lexical
F5TtlRplyCosnSmlrtyCosine similarity of a reply with the thread titleLexicalPure lexical
F6QustionRplyCosnSmlrtyCosine similarity of a reply with the thread initial postLexicalPure lexical
F7UnqWrdsNumber of unique words in a replyLexicalPure lexical
F8IsRplyByCrtrOfInitlPostWas the reply given by the creator of initial post?NonlexicalStructural
F9NumRepliesByUsrCurrentThrdTotal number of replies given by the user in the current threadNonlexicalStructural
F10NoThrdsUsrParticipatedTotal number of threads the user has participatedNonlexicalStructural
F11ReWrdsOvrlpInitialPostNumber of overlapping words between the initial post and the reply postLexicalPure lexical
F12ReWrdsOvrlpThrdTitlNumber of overlapping words between the thread title and the reply postLexicalPure lexical
F13IsRplyContan5WHWrdsDoes the reply contain 5WH words?NonlexicalContent based
F14IsRplyMntionOthrUsrNamesDoes the reply refer to any other forum user?NonlexicalStructural
F15IsRplyHvHyperlnkDoes the reply have any Hyperlink?NonlexicalContent based
F16WMDbtwnTitlRplWord mover distance between thread title and replyLexicalSemantic
F17WMDbtwnQustionRplWord mover distance between initial post and replyLexicalSemantic
F18TotlNoRpliesByUsrInAllThrdsTotal number of replies given by the user in all threadsNonlexicalStructural
F19TotlNoIntialPstsByUserTotal number of initial posts created by the userNonlexicalStructural
F20NoWrdsRplyTotal number of words present in a replyLexicalPure lexical