Research Article

Object Detection Based on Swin Deformable Transformer-BiPAFPN-YOLOX

Table 1

Results of ablation experiments using different components on the COCO 2017 validation set. Swin DeTr denotes Swin Deformable Transformer.

MethodsAPAP50AP75APSAPMAPLParam (M)GFLOPsInfer time (ms)FPS

DarkNet53-PAFPN-YOLOX [2]47.467.352.127.551.560.963.7185.311.190.1
Swin transformer-PAFPN-YOLOX48.467.852.629.352.661.886.9221.614.569.0
Swin DeTr-PAFPN [26]-YOLOX49.168.353.030.754.662.953.2173.112.480.6
Swin DeTr-BiPAFPN-YOLOX49.768.653.231.254.963.344.5110.710.198.7