Research Article

Hierarchical Self-Attention Hybrid Sparse Networks for Document Classification

Table 4

Results in variants models.

ArchitectureModelsYelp 2018IMDB

BaselineHSAHN + GRU73.4891.75
HSAHN + LSTM73.2388.21

Sparse GRUsHSAHSN + GRU173.2293.63
HSAHSN + GRU273.2893.12
HSAHSN + GRU373.2295.69

Sparse LSTMsHSAHSN + LSTM173.0092.18
HSAHSN + LSTM272.7591.23
HSAHSN + LSTM372.9491.25