Research Article

Multiscale Feature Learning Based on Enhanced Feature Pyramid for Vehicle Detection

Table 1

Detection results of the proposed method and other methods on the KITTI testing set. The inference time is evaluated on single Nvidia RTX 3070 GPU per image.

MethodAverage precision (%)Inference speed (s)
EasyModerateHard

Faster R-CNN [1]86.7181.8471.120.84
YOLOv2 [3]76.7961.3150.250.01
Faster R-CNN with FPN backbone [4]88.6284.1473.200.92
MS-CNN [37]90.0389.0276.110.18
Improving faster R-CNN [26]89.2087.8674.720.06
SINet [25]89.6090.6077.750.12
Multitask CNN [20]91.2891.6785.43ā€”
Proposed method93.3592.1881.320.13