Research Article

SENetCount: An Optimized Encoder-Decoder Architecture with Squeeze-and-Excitation for Crowd Counting

Table 2

ResNet, SE-ResNet, and SE-ResNeXt backbone network.

OperateOutput sizeResNet50SE-ResNet50SE-ResNeXt50SE-ResNeXt101

Image1
Conv2d1/2, 64, stride 2
Max pooling1/4, stride 2
Bottleneck11/4
Bottleneck21/8
Bottleneck31/16
Bottleneck41/32

ResNet50 (Left). SE-ResNet50 (Middle). SE-ResNeXt50/101 (Right). The patterns and operations with definite residual building block parameter settings are recorded inside the brackets. The statistic of parallel or cascade blocks in a stage is given outside the brackets. The interior bracket next to denotes the output dimension of the two fully-connected layers in a SE block. suggests grouped-connected layers with 32 groups.