Research Article
Automatic Pipelining and Vectorization of Scientific Code for FPGAs
Figure 12
Speedup achieved over nonvectorized OpenCL baseline plotted against vectorization factor, for the second example (Coriolis). The TyTra solutions are OpenCL-HDL hybrids, and the complete host API, shell, and kernel code for all variants is generated automatically from TIR description. The speedup is calculated for 3 different grid sizes and time steps (legend shows dimension1 × dimension2 × time steps).