Table of Contents Author Guidelines Submit a Manuscript
Scientific Programming
Volume 2019, Article ID 4254676, 12 pages
https://doi.org/10.1155/2019/4254676
Research Article

Implementation and Optimization of a CFD Solver Using Overlapped Meshes on Multiple MIC Coprocessors

1College of Computer and Information Technology, Xinyang Normal University, Henan 464200, China
2Computer Network Information Center, Chinese Academy of Sciences, Beijing 100190, China

Correspondence should be addressed to Wenpeng Ma; nc.ude.unyx@pwam

Received 2 February 2019; Revised 27 March 2019; Accepted 23 April 2019; Published 27 May 2019

Academic Editor: Basilio B. Fraguela

Copyright © 2019 Wenpeng Ma et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Linked References

  1. NVIDIA, NVIDIA Tesla V100 GPU Architecture, NVIDIA, Santa Clara, CA, USA, 2017.
  2. https://www.intel.com/content/www/us/en/architecture-and-technology/many-integrated-core/intel-many-integrated-core-architecture.html.
  3. A. Gorobets, F. Trias, R. Borrell, G. Oyarzun, and A. Oliva, “Direct numerical simulation of turbulent flows with parallel algorithms for various computing architectures,” in Proceedings of the 6th European Conference on Computational Fluid Dynamics, Barcelona, Spain, July 2014.
  4. M. A. A. Farhan, D. K. Kaushik, and D. E. Keyes, “Unstructured computational aerodynamics on many integrated core architecture,” Parallel Computing, vol. 59, pp. 97–118, 2016. View at Publisher · View at Google Scholar · View at Scopus
  5. J. S. Graf, M. K. Gobbert, and S. Khuvis, “Long-time simulations with complex code using multiple nodes of Intel Xeon Phi knights landing,” Journal of Computational and Applied Mathematics, vol. 337, pp. 18–36, 2018. View at Publisher · View at Google Scholar · View at Scopus
  6. Y. Cai, G. Li, and W. Liu, “Parallelized implementation of an explicit finite element method in many integrated core (MIC) architecture,” Advances in Engineering Software, vol. 116, pp. 50–59, 2018. View at Publisher · View at Google Scholar · View at Scopus
  7. S. Saini, H. Jin, D. Jespersen et al., “Early multi-node performance evaluation of a knights corner (KNC) based NASA supercomputer,” in Proceedings of the IEEE International Parallel & Distributed Processing Symposium Workshop, Chicago, FL, USA, May 2015.
  8. Y. X. Wang, L. L. Zhang, W. Liu, X. H. Cheng, Y. Zhuang, and A. T. Chronopoulos, “Performance optimizations for scalable CFD applications on hybrid CPU+MIC heterogeneous computing system with millions of cores,” Computers & Fluids, vol. 173, pp. 226–236, 2018. View at Publisher · View at Google Scholar · View at Scopus
  9. K. Banaś, F. Krużel, and J. Bielański, “Finite element numerical integration for first order approximations on multi-and many-core architectures,” Computer Methods in Applied Mechanics and Engineering, vol. 305, pp. 827–848, 2016. View at Publisher · View at Google Scholar · View at Scopus
  10. W. C. Schneck, E. D. Gregory, and C. A. C. Leckey, “Optimization of elastodynamic finite integration technique on Intel Xeon Phi knights landing processors,” Journal of Computational Physics, vol. 374, pp. 550–562, 2018. View at Publisher · View at Google Scholar · View at Scopus
  11. J. M. Cebrián, J. M. Cecilia, M. Hernández, and J. M. García, “Code modernization strategies to 3-D stencil-based applications on Intel Xeon Phi: KNC and KNL,” Computers & Mathematics with Applications, vol. 74, no. 10, pp. 2557–2571, 2017. View at Publisher · View at Google Scholar · View at Scopus
  12. M. Lukas, Z. Jan, M. Michal et al., “Evaluation of the Intel Xeon Phi offload runtimes for domain decomposition solvers,” Advances in Engineering Software, vol. 125, pp. 46–154, 2018. View at Google Scholar
  13. S. M. I. Gohari, V. Esfahanian, and H. Moqtaderi, “Coalesced computations of the incompressible Navier–Stokes equations over an airfoil using graphics processing units,” Computers & Fluids, vol. 80, no. 1, pp. 102–115, 2013. View at Publisher · View at Google Scholar · View at Scopus
  14. L. Fu, K. Z. Gao, and F. Xu, “A multi-block viscous flow solver based on GPU parallel methodology,” Computers & Fluids, vol. 95, pp. 19–39, 2014. View at Publisher · View at Google Scholar · View at Scopus
  15. W. Cao, C. F. Xu, Z. H. Wang, H. Y. Liu, and H. Y. Liu, “CPU/GPU computing for a multi-block structured grid based high-order flow solver on a large heterogeneous system,” Cluster Computing, vol. 17, no. 2, pp. 255–270, 2014. View at Publisher · View at Google Scholar · View at Scopus
  16. M. Aissa, T. Verstraete, and C. Vuik, “Toward a GPU-aware comparison of explicit and implicit CFD simulations on structured meshes,” Computers & Mathematics with Applications, vol. 74, no. 1, pp. 201–217, 2017. View at Publisher · View at Google Scholar · View at Scopus
  17. C. Xu, X. Deng, L. Zhang et al., “Collaborating CPU and GPU for large-scale high-order CFD simulations with complex grids on the TianHe-1A supercomputer,” Journal of Computational Physics, vol. 278, pp. 275–297, 2014. View at Publisher · View at Google Scholar · View at Scopus
  18. A. Corrigan, F. F. Camelli, R. Löhner, and J. Wallin, “Running unstructured grid-based CFD solvers on modern graphics hardware,” International Journal for Numerical Methods in Fluids, vol. 66, no. 2, pp. 221–229, 2011. View at Publisher · View at Google Scholar · View at Scopus
  19. A. Lacasta, M. Morales-Hernández, J. Murillo, and P. García-Navarro, “An optimized GPU implementation of a 2D free surface simulation model on unstructured meshes,” Advances in Engineering Software, vol. 78, pp. 1–15, 2014. View at Publisher · View at Google Scholar · View at Scopus
  20. P. Barrio, C. Carreras, R. Robles, A. L. Juan, R. Jevtic, and R. Sierra, “Memory optimization in FPGA-accelerated scientific codes based on unstructured meshes,” Journal of Systems Architecture, vol. 60, no. 7, pp. 579–591, 2014. View at Publisher · View at Google Scholar · View at Scopus
  21. Y. Xia, H. Luo, M. Frisbey, and R. Nourgaliev, “A set of parallel, implicit methods for a reconstructed discontinuous Galerkin method for compressible flows on 3D hybrid grids,” in Proceedings of the 7th AIAA Theoretical Fluid Mechanics Conference, Atlanta, GA, USA, 2014.
  22. J. Langguth, N. Wu, J. Chai, and X. Cai, “Parallel performance modeling of irregular applications in cell-centered finite volume methods over unstructured tetrahedral meshes,” Journal of Parallel and Distributed Computing, vol. 76, pp. 120–131, 2015. View at Publisher · View at Google Scholar · View at Scopus
  23. K. Soni, D. D. J. Chandar, and J. Sitaraman, “Development of an overset grid computational fluid dynamics solver on graphical processing units,” Computers & Fluids, vol. 58, pp. 1–14, 2012. View at Publisher · View at Google Scholar · View at Scopus
  24. D. D. J. Chandar, J. Sitaraman, and D. Mavriplis, “GPU parallelization of an unstructured overset grid incompressible Navier–Stokes solver for moving bodies,” in Proceedings of the 50th AIAA Aerospace Sciences Meeting Including the New Horizons Forum and Aerospace Exposition, Nashville, TN, USA, January 2012.
  25. D. Chandar, J. Sitaraman, and D. Mavriplis, “Dynamic overset grid computations for CFD applications on graphics processing units,” in Proceedings of the Seventh International Conference on Computational Fluid Dynamics, Big Island, Hawaii, July 2012.
  26. K. E. Niemeyer and C.-J. Sung, “Recent progress and challenges in exploiting graphics processors in computational fluid dynamics,” Journal of Supercomputing, vol. 67, no. 2, pp. 528–564, 2014. View at Publisher · View at Google Scholar · View at Scopus
  27. G. Edgar, E. F. Graham, B. George et al., “Open MPI: goals, concept, and design of a next generation MPI implementation,” in Proceedings of the 11th European PVM/MPI Users? Group Meeting, pp. 97–104, Budapest, Hungary, September 2004, http://www.open-mpi.org.
  28. M. J. Djomehri and H. Jin, “Hybrid MPI+OpenMP programming of an overset CFD solver and performance investigations,” NASA Ames Research Center, Moffett Field, CA, USA, 2002, NASA Technical Report. View at Google Scholar
  29. B. Chapman, G. Jost, and R. van der Pass, Using OpenMP: Portable Shared Memory Parallel Programming, The MIT Press, Cambridge, MA, USA, 2007.
  30. N. C. Prewitt, D. M. Belk, and W. Shyy, “Parallel computing of overset grids for aerodynamic problems with moving objects,” Progress in Aerospace Sciences, vol. 36, no. 2, pp. 117–172, 2000. View at Publisher · View at Google Scholar · View at Scopus
  31. B. Roget and J. Sitaraman, “Robust and efficient overset grid assembly for partitioned unstructured meshes,” Journal of Computational Physics, vol. 260, pp. 1–24, 2014. View at Publisher · View at Google Scholar · View at Scopus
  32. G. Zagaris, M. T. Campbell, D. J. Bodony et al., “A toolkit for parallel overset grid assembly targeting large-scale moving body aerodynamic simulations,” in Proceedings of the 19th International Meshing Roundtable, pp. 385–401, Springer, Berlin, Heidelberg, October 2010.
  33. J. Cai, F. Tsai, and F. Liu, “A parallel viscous flow solver on multi-block overset grids,” Computers & Fluids, vol. 35, no. 10, pp. 1290–1301, 2006. View at Publisher · View at Google Scholar · View at Scopus
  34. B. Landmann and M. Montagnac, “A highly automated parallel Chimera method for overset grids based on the implicit hole cutting technique,” International Journal for Numerical Methods in Fluids, vol. 66, no. 6, pp. 778–804, 2011. View at Publisher · View at Google Scholar · View at Scopus
  35. W. D. Henshaw, “Solving fluid flow problems on moving and adaptive overlapping grids,” in Proceedings of the International Conference on Parallel Computational Fluid Dynamics, Washington, DC, USA, May 2005.
  36. W. Liao, J. Cai, and H. M. Tsai, “A multigrid overset grid flow solver with implicit hole cutting method,” Computer Methods in Applied Mechanics and Engineering, vol. 196, no. 9–12, pp. 1701–1715, 2007. View at Publisher · View at Google Scholar · View at Scopus
  37. https://ark.intel.com/products/71992/Intel-Xeon-Phi-Coprocessor-5110P-8GB-1-053-GHz-60-core-.
  38. E. Wang, Q. Zhang, B. Shen et al., High-Performance Computing on the Intel® Xeon Phi™, How to Fully Exploit MIC Architectures, Springer, Berlin, Germany, 2014.
  39. P. L. Roe, “Approximate Riemann solvers, parameter vectors, and difference schemes,” Journal of Computational Physics, vol. 43, no. 2, pp. 357–372, 1981. View at Publisher · View at Google Scholar · View at Scopus
  40. A. Jameson, W. Schmidt, and E. Trukel, “Numerical solutions of the Euler equations by finite volume methods using Runge–Kutta time-stepping schemes,” in Proceedings of the 14th Fluid and Plasma Dynamics Conference AIAA Paper, Palo Alto, CA, USA, 1981.
  41. T. H. Pulliam and D. S. Chaussee, “A diagonal form of an implicit approximate-factorization algorithm,” Journal of Computational Physics, vol. 39, no. 2, pp. 347–363, 1981. View at Publisher · View at Google Scholar · View at Scopus
  42. J. Blazek, Computational Fluid Dynamics: Principles and Applications, Elsevier, Amsterdam, Netherlands, 2nd edition, 2005.
  43. Z. Wang, N. Hariharan, and R. Chen, “Recent developments on the conservation property of chimera,” in Proceedings of the 36th AIAA Aerospace Sciences Meeting and Exhibit, Reno, NV, USA, January 1998. View at Publisher · View at Google Scholar
  44. R. L. Meakin, “On the spatial and temporal accuracy of overset grid methods for moving body problems,” in Proceedings of the 12th Applied Aerodynamics Conference AIAA Paper 1994-1925, Colorado Springs, CO, USA, June 1994.
  45. S. E. Rogers, N. E. Suhs, and W. E. Dietz, “PEGASUS 5: an automated preprocessor for overset-grid computational fluid dynamics,” AIAA Journal, vol. 41, no. 6, pp. 1037–1045, 2003. View at Publisher · View at Google Scholar · View at Scopus
  46. W. Ma, X. Hu, and X. Liu, “Parallel multibody separation simulation using MPI and OpenMP with communication optimization,” Journal of Algorithms & Computational Technology, vol. 13, pp. 1–17, 2018. View at Publisher · View at Google Scholar
  47. Y. Wu, “Numerical simulation and Aerodynamic effect research for multi-warhead projection,” Journal of System Simulation, vol. 28, no. 7, pp. 1552–1560, 2016, in Chinese. View at Google Scholar