Research Article
Focal CTC Loss for Chinese Optical Character Recognition on Unbalanced Datasets
Figure 3
Convolutional layers (ResNet) which are used to extract image feature sequences. The basic building block is residual learning unit, surrounded by the green dash box.