Research Article

Multimodal Feature Learning for Video Captioning

Figure 3

Overall framework of the proposed video captioning model.