Research Article

Focal CTC Loss for Chinese Optical Character Recognition on Unbalanced Datasets

Figure 3

Convolutional layers (ResNet) which are used to extract image feature sequences. The basic building block is residual learning unit, surrounded by the green dash box.