Research Article

Object Detection Based on Swin Deformable Transformer-BiPAFPN-YOLOX

Table 5

Experimental results of various scales networks on the COCO 2017 test set. Swin DeTr denotes Swin Deformable Transformer.

ModelsAP (%)AP50AP75APSAPMAPLParam (M)GFLOPsInfer time (ms)FPS

DarkNet53-PAFPN-YOLOX-S [2]39.664.647.522.748.454.19.026.89.8102.0
Swin DeTr-BiPAFPN-YOLOX-S44.767.750.325.950.959.67.116.38.2122.4
DarkNet53-PAFPN-YOLOX-M [2]46.465.450.626.351.059.925.373.812.381.3
Swin DeTr-BiPAFPN-YOLOX-M48.469.353.728.752.461.221.250.69.6104.3
DarkNet53-PAFPN-YOLOX-L [2]50.068.554.529.854.564.468.2195.614.569.0
Swin DeTr-BiPAFPN-YOLOX-L51.869.655.431.755.866.063.5181.710.793.7
DarkNet53-PAFPN-YOLOX-X [2]51.269.655.731.256.166.199.1286.917.357.8
Swin DeTr-BiPAFPN-YOLOX-X52.170.457.831.956.966.785.5225.414.171.0