Research Article

SENetCount: An Optimized Encoder-Decoder Architecture with Squeeze-and-Excitation for Crowd Counting

Table 5

Ablation experiments.

NetworksPart_APart_BUCF_QNRFMall
MAERMSEMAERMSEMAERMSEMAERMSE

SE-ResNetCount5076.0120.57.712.2111.1194.11.221.58
SE-ResNeXtCount5071.9118.78.013.3104.9182.61.201.52
SE-ResNeXtCount50+SSIM71.7118.08.213.8103.6181.51.191.53
SE-ResNeXtCount50+MS-SSIM71.8117.08.013.2104.8182.91.191.52
SE-ResNeXtCount10171.8115.47.412.6107.9203.31.201.57
SE-ResNeXtCount101+SSIM71.0115.47.512.5108.0206.41.191.55
SE-ResNeXtCount101+MS-SSIM71.0115.07.312.1107.7201.11.151.49

SE-ResNetCount50 and SE-ResNeXtCount50/101, respectively, choose SE-ResNet or SE-ResNeXt as the backbone network and adopt the DASPP module and FFM module given in Figure 1. +SSIM and +MS-SSIM indicate that the objective loss function combines the Euclidean loss with the SSIM or MS-SSIM index.