Research Article

A Chinese Lip-Reading System Based on Convolutional Block Attention Module

Table 1

The accuracy of each model.

ModelPrams (M)Top-1 accuracy (%)

Vgg16 + LSTM + Attention144.495.2
Vgg16 + GRU + Attention142.495.3
InceptionV3 + LSTM + Attention31.998.2
InceptionV3 + GRU + Attention29.999.1
ResNet50 + LSTM + Attention33.698.2
ResNet50 + GRU + Attention31.699.3
ResNet101 + LSTM + Attention52.697.3
ResNet101 + GRU + Attention50.699.6
ResNet152 + LSTM + Attention68.398.4
ResNet152 + GRU + Attention66.399.8
ResNet50 + CBAM + LSTM + Attention36.198.7
ResNet50 + CBAM + GRU + Attention34.199.6