Research Article

Hierarchical Self-Attention Hybrid Sparse Networks for Document Classification

Table 2

Sparse word encoder setting.

Sparse word encoder parameterSetting

Dropout rate0.1
RNN output size50
Activate functionReLU
Self-attention output size100