Research Article

Context-Fused Guidance for Image Captioning Using Sequence-Level Training

Figure 3

An illustration of the context gate. is the scalar factor, st is the fused textual context, and E(y) indicates the word embedding vectors.