Research Article

Hybrid MPI and CUDA Parallelization for CFD Applications on Multi-GPU HPC Clusters

Table 1

Main performance parameters of GPUs.

ā€‰GTX 1070Tesla V100

Date introducedJune 2016January 2018
ArchitecturePascalVolta
Computation capability6.17.0
Device memory (GB)832
Streaming multiprocessors1580
Stream processors1,9205,120
Single precision (TFLOP/S)614
Double precision (TFLOP/S)0.27
Memory bandwidth (GB/s)256900