Research Article

Improving Feature Representation Based on a Neural Network for Author Profiling in Social Media Texts

Table 11

Obtained results (accuracy, %) for age and gender classification on the PAN author profiling 2016 Dutch training corpus under 10-fold cross-validation.

Feature set Gender
LR-NPLR-WPSVM-NPSVM-WP

D2V (1-gram) 74.7477.6071.0975.26
D2V (1 + 2-grams) 70.8375.7871.0975.52
D2V (1 + 2 + 3-grams) 73.4476.0470.3173.44
Character 3-grams 76.5672.6674.4872.92
Bag-of-Words 74.4871.8874.7470.83