Research Article

Parallel Algorithms of Well-Balanced and Weighted Average Flux for Shallow Water Model Using CUDA

Figure 11

Different max threads per block (256, 512, and 1024) and min block per SM (1–8): (a) registers per thread, (b) occupancy, and (c) execution times.
(a)
(b)
(c)