Research Article

Parallelization of an Unsteady ALE Solver with Deforming Mesh Using OpenACC

Table 4

Performance comparisons of multicore CPU and GPU results on mesh deformation algorithm.

2D case Single CPU coreGPU20 OpenMP threads

5672268 nodesMatrix assembly9.8 s0.23 s0.7 s
11117646 trianglesP-BiCGSTAB52 s1.63 s4.19 s

3D caseSingle CPU coreGPU20 OpenMP threads

4127686 nodesMatrix assembly12.6 s0.33 s0.78 s
24023132 tetrahedronsP-BiCGSTAB58.1 s1.84 s4.83 s