Research Article

An Evaluation of Deep Learning Methods for Small Object Detection

Table 3

Comparative results on small object dataset.

MethodBackboneClockFaucetJarMouseOutletPlateSwitchTel.t. boxt. papermAP

YOLO 416 [16]Darknet-1922.830.845220.413.1136.1035.319.39
YOLO 448 [16]2336.9952.518.413.617.54.2034.320.13
YOLO 480 [16]34.237.39.153.321.413.615.89.19.134.223.71
YOLO 512 [16]23.136.66.159.824.614.215.79.14.532.422.61
YOLO 554 [16]23.437.29.160.127.213.419.99.14.534.523.84
YOLO 640 [16]20.236.23.259.827.811.718.18.24.535.622.53
YOLO 800 [16]27.6362.360.232.813.123.39.19.126.724.02
YOLO 1024 [16]21.729.31.458.326.411.817.59.19.115.720.03
YOLO 320Darknet-5326.2238.384.5556.4636.4213.3424.810.654.5542.9625.83
YOLO 41628.4747.1510.8360.4943.1515.8730.7315.152.6248.330.28
YOLO 60829.9847.8910.7665.8848.0218.0931.2214.6217.9946.5633.1
YOLO 320ResNet-5019.5725.730.6745.1714.379.3813.849.099.0923.717.06
YOLO 41623.7836.650.454.2318.3713.7519.789.849.4235.6822.19
YOLO 60826.9240.651.7761.8629.1815.0420.2410.0913.2936.0125.5
YOLO 320ResNet-10120.5227.90.5744.6816.9813.0513.669.669.0924.3618.05
YOLO 41625.7235.63.0355.7322.415.6117.269.323.0338.7122.64
YOLO 60828.7944.599.4262.1833.3415.5323.8813.2415.8339.1728.6
YOLO 320ResNet-15221.6427.563.0348.0617.3911.1214.519.094.5531.8818.88
YOLO 41625.736.540.8953.8120.614.1320.2111.490.2933.0621.67
YOLO 60826.0144.544.556131.7613.0222.6712.359.9339.9926.58
SSD300 [16]ResNet-1015.59.1025.56.14.504.59.118.28.25
SSD300 [16]VGG169.117.1026.19.19.104.5016.79.16
SSD512 [16]VGG169.117.10439.19.19.19.107.611.32
RetinaNetResNet-50-FPN30.749.3265.521.316.18.512.9125.723.3
RetinaNetResNet-101-FPN30.648.77.164.72015.911.810.72.938.725.1
RetinaNetResNeXT-101-32  8d-FPN35.55512.166.523.918.49.816.29.453.730
RetinaNetResNeXT-101-64  4d-FPN31.450.28.966.320.815.39.4142.232.425.1
R-CNN [13]RPN prop. + VGG1631.931.34.256.831.19.314.216.423.429.424.8
R-CNN [13]Alexnet, 7, 300 pro32.427.25.156.9289.813.612.417.935.623.9
R-CNN [13]VGG16, 7, 300 pro37.330.37.260.641.515.821.513.72233.328.4
R-CNN [13]ContextNet (Alexnet, 7)32.726.84.656.426.39.912.912.218.73423.5
Fast RCNNResNet-50-C432.446.36.565.838.320.125.316.614.15231.7
Fast RCNNResNet-50-FPN37.447.37.368.946.72132.117.19.345.933.3
Fast RCNNResNet-101-FPN39.350.310.668.347.120.433.318.615.451.435.5
Fast RCNNResNeXT-101-32  8d-FPN47.554.810.371.85421.434.421.717.753.538.7
Fast RCNNResNeXT-101-64  4d-FPN45.455.710.972.553.32436.922.91658.139.6
Faster R-CNN [16]VGG1623.7637.658.035416.1611.8815.129.16.2537.2921.92
Faster RCNNResNet-50-C432.244.66.665.935.217.525.719.613.74030.1
Faster RCNNResNet-50-FPN35.749.97.368.448.918.829.614.711.453.333.8
Faster RCNNResNet-101-FPN39.849.24.968.24718.529.71412.952.233.7
Faster RCNNResNeXT-101-32  8d-FPN49.856.611.472.156.323.23720.818.858.740.5
Faster RCNNResNeXT-101-64  4d-FPN49.658.612.272.554.523.236.920.820.163.141.2

The values in bold represent the best in one-stage methods, and the ones in italics represent the highest in two-stage methods.