Research Article

Automatic Pipelining and Vectorization of Scientific Code for FPGAs

Figure 12

Speedup achieved over nonvectorized OpenCL baseline plotted against vectorization factor, for the second example (Coriolis). The TyTra solutions are OpenCL-HDL hybrids, and the complete host API, shell, and kernel code for all variants is generated automatically from TIR description. The speedup is calculated for 3 different grid sizes and time steps (legend shows dimension1 × dimension2 × time steps).