Research Article

Hierarchical Self-Attention Hybrid Sparse Networks for Document Classification

Table 3

Sparse sentence encoder setting.

Sparse sentence encoder parameterSetting

Dropout rate0.1
RNN output size50
Activate functionReLU
Self-attention output size100
Kernel regularizerL2