Research Article

Context-Fused Guidance for Image Captioning Using Sequence-Level Training

Table 3

Performance comparisons on MS COCO Karpathy test split under CIDEr-D score optimization.

Sequence-level optimization
MetricBLEU1BLEU2BLEU3BLEU4METEORROUGE-LCIDErSPICE

NIC [16]31.954.3106.3
SCST [11]34.226.755.7114.0
Up-down [4]79.836.327.756.9120.121.4
RFNet [17]79.163.148.436.527.757.3121.921.2
HAN [20]80.964.649.837.627.858.1121.721.5
RAtt-soft [29]80.463.448.937.528.561.6122.122.1
CFG80.564.750.238.328.258.3125.421.6

The best results (%) are highlighted in boldface. The symbol “—” indicates the results are not reported.