Research Article

An Efficient Multiscale Pyramid Attention Network for Face Detection in Surveillance Images

Table 1

The architecture of backbone.

Stage iOperatorResolutionChannelsLayers

1Conv3 × 3512 × 512451
2MBConv1256 × 256221
3MBConv6256 × 256244
4MBConv6128 × 128404
5MBConv6128 × 128805
6MBConv664 × 641125
7MBConv664 × 641927
8MBConv632 × 323202
9Conv1 × 1 and pooling and FC32 × 3212801

MBConv denotes mobile inverted convolutional bottleneck.