Research Article

Fast Vehicle and Pedestrian Detection Using Improved Mask R-CNN

Table 5

Experimental data.

Backbone_classFPN + resnet101_81FPN + resnet86_81FPN + resnet50_81FPN + resnet101_3FPN + resnet86_3FPN + resnet50_3

59.1158.7446.1675.6974.8467.09
55.0254.8842.0470.2469.2362.15
47.4547.8835.1360.3458.7751.98
33.3533.8822.8539.9639.0431.59
76.7675.9670.6080.1078.8973.21
53.2950.8745.7752.7351.3846.99
83.1581.2675.8684.0983.2876.71
Min_train_loss0.95640.90921.2870.65920.71380.8643

The evaluation standard uses the mAP. The number in the upper right corner of mAP indicates the number of input/output units (IOUs). The words “person,” “car,” and “bus” in the bottom right corner mean the detection of the single category. Min_train_loss refers to the loss value of each model after training.