Research Article

Deep Visual Semantic Embedding with Text Data Augmentation and Word Embedding Initialization

Table 5

Experimental results with text data augmentation on MS-COCO.

ModelFeature (s)Image countImage retrieval
R@1R@5R@10Med rR@1R@5R@10Med r

VSAR–CNN + BRNN38.469.980.5127.460.274.83
VSE++VGG + GRU + HNM43.674.884.6233.768.881.03
OursAug45.175.885.3233.867.480.23