Research Article

Large-Scale Video Retrieval via Deep Local Convolutional Features

Table 1

Structure of VGG16.

LayerOutput size

Conv3-64224  224  64
Conv3-64224  224  64

Max-pooling112  112  64
Conv3-128112  112  128
Conv3-128112  112  128

Max-pooling56  56  128
Conv3-25656  56  256
Conv3-25656  56  256
Conv3-25656  56  256

Max-pooling28  28  256
Conv3-51228  28  512
Conv3-51228  28  512
Conv3-51228  28  512

Max-pooling14  14  512
Conv3-51214  14  512
Conv3-51214  14  512
Conv3-51214  14  512

Max-pooling7  7  512