Research Article

An Enhanced Visual Attention Siamese Network That Updates Template Features Online

Table 1

Network structure and operations corresponding to each network block (block represents network block, Gold-SPool represents golden stochastic pooling, dilation represents dilated convolution, ResNet in Figure 1 includes Block1, Block2, Block3, Block4, and Block5, and “—” represents no operation).

BlockOperationTemplate sizeSearch size

127 × 127 × 3255 × 255 × 3

Block17 × 7, 64, 3 × 3Gold-SPool, s = 231 × 31 × 6462 × 62 × 64

Block215 × 15 × 25631 × 31 × 256

Block3 + dilation15 × 15 × 51231 × 31 × 512

Attention15 × 15 × 25631 × 31 × 256

Block4 + dilation15 × 15 × 102431 × 31 × 1024

Attention15 × 15 × 51231 × 31 × 512

Block5 + dilation15 × 15 × 204831 × 31 × 2048

Attention15 × 15 × 102431 × 31 × 1024