Research Article

The Case for Higher Computational Density in the Memory-Bound FDTD Method within Multicore Environments

Table 2

GPU kernel throughput for the standard FDTD algorithm at several thread-block size configurations.

MCells/s

4 × 4 × 4177
8 × 8 × 8221
16 × 8 × 4301
16 × 8 × 8249
32 × 8 × 4350