Research Article

High Performance Implementation of 3D Convolutional Neural Networks on a GPU

Table 2

Convolution layers of a 3D network; the filter size in all layers is , and the GFLOPS columns calculate the number of flops operations in each convolutional layer. Assume the batch size is .

Layer × × × × GFLOPS

conv13 × 16 × 112 × 112 × 323216.65
conv232 × 16 × 56 × 56 × 326488.8
conv364 × 8 × 28 × 28 × 3225688.8
conv4256 × 4 × 14 × 14 × 3225644.4
conv5256 × 2 × 7 × 7 × 322565.55