Research Article
Parallel Algorithms of Well-Balanced and Weighted Average Flux for Shallow Water Model Using CUDA
Table 2
Speedup compared with previous program for various problems.
| | Rectangular dam break | Circular dam break on wet bed | Circular dam break on dry bed | Dam break flows over three humps | 16k | 1M | 16k | 1M | 16k | 1M | 16k | 1M |
| Serial: cell-based | Baseline | Baseline | Baseline | Baseline | Baseline | Baseline | Baseline | Baseline | Serial: edge-based | 1.90 | 1.92 | 1.87 | 1.24 | 1.92 | 1.52 | 1.74 | 1.81 | Parallel V1 | 6.55 | 9.70 | 11.19 | 15.81 | 9.92 | 14.05 | 7.91 | 9.67 | Parallel occupancy | 2.42 | 2.79 | 1.81 | 2.02 | 1.80 | 2.05 | 2.32 | 2.50 | Parallel memory pattern | 1.33 | 1.02 | 1.27 | 1.01 | 1.24 | 1.01 | 0.99 | 1.01 | Parallel unroll | 1.19 | 1.20 | 1.20 | 1.23 | 1.18 | 1.22 | 1.26 | 1.15 |
|
|