Research Article

Object Detection Based on Swin Deformable Transformer-BiPAFPN-YOLOX

Table 4

Comparison of training results of Swin DeTr-BiPAFPN-YOLOX, Swin Tr-BiPAFPN-YOLOX and others on the COCO 2017 test set. Swin Tr denotes Swin Transformer, Swin DeTr denotes Swin Deformable Transformer.

MethodEpochsAPAP50AP75APSAPMAPLParam (M)GFLOPsInfer time (ms)FPS

YOLOv4 [3]30043.565.747.326.746.753.386.5223.515.664.0
YOLOv5 [4]30044.563.121.451.411.190.1
DETR [7]50042.062.444.220.545.861.141.486.935.728.0
DarkNet53-PAFPN-YOLOX [2]30047.467.352.127.551.560.963.7185.311.190.1
Swin Tr-BiPAFPN-YOLOX35048.467.852.629.352.661.879.4211.813.773.0
Swin DeTr-BiPAFPN-YOLOX15649.768.653.231.254.963.344.5110.710.198.7