Research Article

An FPGA-Based Hardware Accelerator for CNNs Using On-Chip Memories Only: Design and Benchmarking with Intel Movidius Neural Compute Stick

Table 1

Convolutional parameters for the network.

LayerInput matrixFilterCinCoutOutput matrix

Hidden layer 0Time_063 × 135 × 11159 × 13
Freq_059 × 131 × 31859 × 11
Hidden layer 1Time_159 × 115 × 18855 × 11
Freq_155 × 111 × 381655 × 9
Hidden layer 2Time_255 × 911 × 1161645 × 9
Freq_245 × 91 × 31619245 × 7
Final_conv45 × 71 × 11921245 × 7