Research Article

Parallel Algorithms of Well-Balanced and Weighted Average Flux for Shallow Water Model Using CUDA

Table 4

Running time and speedup for Srinakarin Dam break simulation.

Time (ms)Speedup compared with previous programSpeedup compared with baseline

Serial: cell-based29,156,017BaselineBaseline
Serial: edge-based15,845,6611.841.84
Parallel V11,689,7039.3817.26
Parallel occupancy647,0452.6145.06
Parallel memory pattern621,0901.0446.94
Parallel unroll508,6071.2257.33