Research Article

Fast Vehicle and Pedestrian Detection Using Improved Mask R-CNN

Table 3

Network structure contrast of Resnet-50/Resnet-86/Resnet-101.

BackboneOutput sizeResnet-50Resnet-86Resnet-101

Conv_1512 ∗ 5127 ∗ 7, 64, stride2
Conv_2256 ∗ 2563 ∗ 3 maxpool, stride2
Conv_3128 ∗ 128
Conv_464 ∗ 64
Conv_532 ∗ 32
1 ∗ 1Average pool, 1000-d fc, softmax