Research Article

Context-Fused Guidance for Image Captioning Using Sequence-Level Training

Table 2

Performance comparisons on MS COCO Karpathy test split under cross-entropy training.

Cross-entropy loss
MetricBLEU1BLEU2BLEU3BLEU4METEORROUGE-LCIDErSPICE

NIC [16]29.652.694.0
SCST [11]30.025.953.499.4
Up-down [4]77.236.227.056.4113.520.3
RFNet [17]76.460.446.635.827.456.8112.520.5
HAN [20]77.261.247.736.227.556.6114.820.6
RAtt-Soft [29]79.261.847.636.928.360.9114.320.8
CFG77.161.547.936.827.756.7114.020.8

The best results (%) are highlighted in boldface. The symbol “—” indicates the results are not reported.