Research Article
Scalable Parallel Algorithm of Multiple-Relaxation-Time Lattice Boltzmann Method with Large Eddy Simulation on Multi-GPUs
Algorithm 3
Multi-GPUs of MRT-LBM-LES.
Read grid file. | Domain decomposition. | Memory allocation on host and device. | Initialization. | Copy data from host to device. | Judgement of the lattice style. | Iterative computation until satisfying convergence condition on GPUs. | Read data from global memory and propagation. | Deal with boundary condition. | Calculate macroscopic quantities. | Collision and write data back. | Data exchange of outer subdomain using MPI. | Copy data from device to host. | Gather and write data back to host memory. |
|