Research Article

A Supervised Approach to Predict the Hierarchical Structure of Conversation Threads for Comments

Table 7

The difference between the evaluation metrics in presence and absence of features.

FeatureDatasetEvaluation metric
AccCTD

SimilarityENENews−0.002330.07415−0.005380.03289−0.010950.089960.04684
Russianblog0.008740.000120.009630.003470.00695−0.010790.00355

Authors' languageENENews−0.013540.03246−0.017850.00693−0.027570.040520.00458
Russianblog−0.002290.01052−0.001790.00750−0.005900.00807−0.00182

Global similarityENENews−0.004020.05604−0.013380.01581−0.011290.070860.02742
Russianblog0.010900.014430.005820.008970.00368−0.00317−0.00087

Frequent wordsENENews0.002520.009310.001700.00424−0.001320.009730.00292
Russianblog0.000080.000230.000160.000290.000340.00047−0.00011

Length ratioENENews0.04206−0.029840.044500.017000.06421−0.05336−0.00850
Russianblog0.00651−0.003550.009200.000800.00784−0.005920.00235

Authors' nameENENews0.037530.063920.019200.032850.023410.059980.01788
Russianblog0.000000.000000.000000.000000.000000.000000.00000

Frequent patternENENews−0.020280.02930−0.023570.00300−0.037930.034100.00526
Russianblog0.004580.061000.008300.05206−0.005450.061500.00404

Location priorENENews0.08778−0.036900.085560.032800.11792−0.078520.02925
Russianblog0.160910.096710.177660.135150.166990.087930.02414

Candidate filtering rule 1ENENews−0.054310.06138−0.041020.01296−0.064580.075450.06949
Russianblog−0.125060.34370−0.139160.27445−0.126830.370030.32546

Candidate filtering rule 2ENENews0.00014−0.000040.000480.00041−0.00038−0.00035−0.00009
Russianblog0.025750.024750.024620.023410.017460.008020.00000

Candidate filtering rule 3ENENews0.00357−0.001210.004520.002540.00551−0.001610.00161
Russianblog0.08521−0.019780.096360.017020.09380−0.038250.07371