Research Article
OpenCL-Based FPGA Accelerator for 3D FDTD with Periodic and Absorbing Boundary Conditions
Table 2
Comparison with GPUs and CPUs using single-precision floating-point computation.
| | FPGA | GPU | Multicore CPU | DE5 | 395-D8 | GTX680 | GTX750Ti | i7-4960x | E5-1650 v3 |
| Number of | — | — | 1152 | 1024 | 6 | 6 | Core clock frequency (MHz) | 260 | 193 | 980 | 1127 | 3600 | 3500 | Memory bandwidth (GB/s) | 25.6 | 34.1 | 192.2 | 86.4 | 51.2 | 59.7 | Peak performance (Gflop/s) | 193 | 1502.9 | 3090 | 1305 | 345.6 | 672 |
| Processing time (s) | 6.31 | 7.36 | 9.39 | 10.71 | 23.63 | 20.84 |
|
|
: CUDA cores; multicore CPU: CPU cores.
|