Research Article

Hybrid MPI and CUDA Parallelization for CFD Applications on Multi-GPU HPC Clusters

Table 3

The runtime for GTX 1070 multi-GPU clusters.

No.Two GPUs (ms)Three GPUs (ms)Four GPUs (ms)

Mesh 112.6211.2710.38
Mesh 220.8618.1916.01
Mesh 335.5628.1923.38
Mesh 467.5552.4342.33
Mesh 5126.9396.2477.15
Mesh 6252.01187.84144.43
Mesh 7499.02369.68276.86
Mesh 8ā€”ā€”540.73