Research Article

Multimodal Feature Learning for Video Captioning

Figure 8

Working time of the SeFLA model on each MSVD test video.