Multi-GPU Immersed Boundary Method Hemodynamics Simulations

Jeff Ames; Daniel F Puleri; Peter Balogh; John Gounley; Erik W Draeger; Amanda Randles

doi:10.1016/j.jocs.2020.101153

Multi-GPU Immersed Boundary Method Hemodynamics Simulations

J Comput Sci. 2020 Jul:44:101153. doi: 10.1016/j.jocs.2020.101153. Epub 2020 Jun 14.

Authors

Jeff Ames¹, Daniel F Puleri², Peter Balogh², John Gounley³, Erik W Draeger⁴, Amanda Randles²

Affiliations

¹ Department of Computer Science, Duke University, Durham, NC USA.
² Department of Biomedical Engineering, Duke University, Durham, NC USA.
³ Computational Sciences and Engineering Division, Oak Ridge National Laboratory, Oak Ridge, TN USA.
⁴ Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, Livermore, CA USA.

Abstract

Large-scale simulations of blood flow that resolve the 3D deformation of each comprising cell are increasingly popular owing to algorithmic developments in conjunction with advances in compute capability. Among different approaches for modeling cell-resolved hemodynamics, fluid structure interaction (FSI) algorithms based on the immersed boundary method are frequently employed for coupling separate solvers for the background fluid and the cells within one framework. GPUs can accelerate these simulations; however, both current pre-exascale and future exascale CPU-GPU heterogeneous systems face communication challenges critical to performance and scalability. We describe, to our knowledge, the largest distributed GPU-accelerated FSI simulations of high hematocrit cell-resolved flows with over 17 million red blood cells. We compare scaling on a fat node system with six GPUs per node and on a system with a single GPU per node. Through comparison between the CPU- and GPU-based implementations, we identify the costs of data movement in multiscale multi-grid FSI simulations on heterogeneous systems and show it to be the greatest performance bottleneck on the GPU.

Keywords: GPU; distributed parallelization; fluid structure interaction; immersed boundary method; lattice Boltzmann method.

Grants and funding

DP5 OD019876/OD/NIH HHS/United States