Research Article
Parallel Algorithms of Well-Balanced and Weighted Average Flux for Shallow Water Model Using CUDA
Figure 11
Different max threads per block (256, 512, and 1024) and min block per SM (1–8): (a) registers per thread, (b) occupancy, and (c) execution times.
(a) |
(b) |
(c) |