×

GPU-acceleration for moving particle semi-implicit method. (English) Zbl 1271.76264

Summary: The MPS (Moving Particle Semi-implicit) method has been proven useful in computation free-surface hydrodynamic flows. Despite its applicability, one of its drawbacks in practical application is the high computational load. On the other hand, Graphics Processing Unit (GPU), which was originally developed for acceleration of computer graphics, now provides unprecedented capability for scientific computations.
The main objective of this study is to develop a GPU-accelerated MPS code using CUDA (Compute Unified Device Architecture) language. Several techniques have been shown to optimize calculations in CUDA. In order to promote the acceleration by GPU, particular attentions are given to both the search of neighboring particles and the iterative solution of simultaneous linear equations in the Poisson Pressure Equation.
In this paper, 2-dimensional calculations of elliptical drop evolution and dam break flow have been carried out by the GPU-accelerated MPS method, and the accuracy and performance of GPU-based code are investigated by comparing the results with those by CPU. It is shown that results of GPU-based calculations can be obtained much faster with the same reliability as the CPU-based ones.

MSC:

76M28 Particle methods and lattice-gas methods
65Y05 Parallel numerical computation

Software:

CUDA
PDFBibTeX XMLCite
Full Text: DOI Link Link

References:

[1] Bell N, Garland M. Efficient sparse matrix-vector multiplication on CUDA. NVIDIA Technical Report NVR-2008-004; 2008. p. 32.; Bell N, Garland M. Efficient sparse matrix-vector multiplication on CUDA. NVIDIA Technical Report NVR-2008-004; 2008. p. 32.
[2] Fixstars Corporation. NVIDIA CUDA Information Site. <http://gpu.fixstars.com/; Fixstars Corporation. NVIDIA CUDA Information Site. <http://gpu.fixstars.com/
[3] Gingold, R. A.; Monaghan, J. J., Kernel estimates as a basis for general particle methods in hydrodynamics, J Comp Phys, 46, 429-453 (1982) · Zbl 0487.76010
[4] Gotoh, H., Lagrangian particle method as advanced technology for numerical wave flume, Int J Offshore Polar Eng, 19, 3, 161-167 (2009)
[5] Gotoh, H.; Sakai, T., Key issues in the particle method for computation of wave breaking, Coast Eng, 53, 171-179 (2006)
[6] Gotoh, H.; Ikari, H.; Memita, T.; Sakai, T., Lagrangian particle method for simulation of wave overtopping on a vertical seawall, Coast Eng J, 47, 2-3, 157-181 (2005)
[7] Harada T, Masaie I, Koshizuka S, Kawaguchi Y. Accelerating particle-based simulations utilizing spatial locality on the GPU. Transactions of JSCES, Paper No. 20080016; 2008 [in Japanese].; Harada T, Masaie I, Koshizuka S, Kawaguchi Y. Accelerating particle-based simulations utilizing spatial locality on the GPU. Transactions of JSCES, Paper No. 20080016; 2008 [in Japanese].
[8] Harish P, Nrayanan PJ. Accelerating large graph algorithms on the GPU using CUDA, HiPC 2007. LNCS 2007;4873:197-208.; Harish P, Nrayanan PJ. Accelerating large graph algorithms on the GPU using CUDA, HiPC 2007. LNCS 2007;4873:197-208.
[9] Harris M. Optimizing parallel reduction in CUDA. NVIDIA CUDA SDK 2.0; 2008.; Harris M. Optimizing parallel reduction in CUDA. NVIDIA CUDA SDK 2.0; 2008.
[10] Hu, C. H.; Kashiwagi, M., A CIP method for numerical simulations of violent free surface flows, J Marine Sci Technol, 9, 4, 143-157 (2004)
[11] Ikari H, Gotoh H. Parallelization of MPS method for 3D wave analysis. In Advances in hydro-science and engineering, 8th international conference on hydro-science and engineering (ICHE). Nagoya, Japan; 2008.; Ikari H, Gotoh H. Parallelization of MPS method for 3D wave analysis. In Advances in hydro-science and engineering, 8th international conference on hydro-science and engineering (ICHE). Nagoya, Japan; 2008.
[12] Khayyer, A.; Gotoh, H., Development of CMPS method for accurate water-surface tracking in breaking waves, Coast Eng J, 20, 2, 179-207 (2008)
[13] Khayyer, A.; Gotoh, H., Modified moving particle semi-implicit methods for the prediction of 2D wave impact pressure, Coast Eng, 56, 4, 419-440 (2009)
[14] Khayyer, A.; Gotoh, H., A higher order Laplacian model for enhancement and stabilization of pressure calculation by the MPS method, Appl Ocean Res, 32, 1, 124-131 (2010)
[15] Khayyer, A.; Gotoh, H., Enhancement of stability and accuracy of the moving particle semi-implicit method, J Comp Phys, 230, 8, 3093-3118 (2011) · Zbl 1316.76084
[16] Khayyer, A.; Gotoh, H.; Shao, S. D., Corrected incompressible SPH method for accurate water-surface tracking in breaking waves, Coast Eng, 55, 3, 236-250 (2008)
[17] Koshizuka, S.; Oka, Y., Moving-particle semi-implicit method for fragmentation of incompressible fluid, Nucl Sci Eng, 123, 421-434 (1996)
[18] Liu, W.; Schmidt, B.; Voss, G.; Muller-Wittig, W., Accelerating molecular dynamics simulations using graphics processing units with CUDA, Comput Phys Commun, 179, 634-641 (2008)
[19] Maruyama N, Nukada A, Matsuoka S. Software-based ECC for GPUs. In: Presented at symposium on application accelerators in high performance computing. Urbana, Illinois, US; July 27-31, 2009.; Maruyama N, Nukada A, Matsuoka S. Software-based ECC for GPUs. In: Presented at symposium on application accelerators in high performance computing. Urbana, Illinois, US; July 27-31, 2009.
[20] McCabe C, Causon DM, Mingham CG. Graphics processing unit accelerated calculations of free surface flows using smoothed particle hydrodynamics. In: 4th International SPHERIC workshop. Nantes, France, May 27-29, 2009, pp. 384-91.; McCabe C, Causon DM, Mingham CG. Graphics processing unit accelerated calculations of free surface flows using smoothed particle hydrodynamics. In: 4th International SPHERIC workshop. Nantes, France, May 27-29, 2009, pp. 384-91.
[21] Monaghan, J. J., Simulating free surface flows with SPH, J Comput Phys, 110, 399-406 (1994) · Zbl 0794.76073
[22] NVIDIA. NVIDIA CUDA compute unified device architecture programming guide. Version 2.0; 2008.; NVIDIA. NVIDIA CUDA compute unified device architecture programming guide. Version 2.0; 2008.
[23] NVIDIA. Compute unified device architecture. <http://www.nvidia.com/object/cuda_home.html; NVIDIA. Compute unified device architecture. <http://www.nvidia.com/object/cuda_home.html
[24] Ogawa S, Aoki T. GPU Computing for 2-dimensional incompressible-flow Simulation based on Multigrid method. Transactions of JSCES, Paper No. 20090021; 2009 [in Japanese].; Ogawa S, Aoki T. GPU Computing for 2-dimensional incompressible-flow Simulation based on Multigrid method. Transactions of JSCES, Paper No. 20090021; 2009 [in Japanese].
[25] Rossinelli, D.; Koumoutsakos, P., Vortex methods for incompressible flow simulations on the GPU, Visual Comput, 24, 699-708 (2008)
[26] Rossinelli, D.; Bergdorf, M.; Cottet, G.-H.; Koumoutsakos, P., GPU accelerated simulations of bluff body flows using vortex particle methods, J Comp Phys, 229, 3316-3333 (2010) · Zbl 1307.76066
[27] Shao, S. D.; Gotoh, H., Turbulence particle models for tracking free surfaces, J Hydraul Res, 43, 3, 276-289 (2005)
[28] Shao, S. D.; Lo, E. Y.M., Incompressible SPH method for simulating Newtonian and non-Newtonian flows with a free surface, Adv Water Resour, 26, 7, 787-800 (2003)
[29] Takai Y, Nagai G. Fast calculation of conjugate gradient method by GPU. In: Proceedings of the conference on computational engineering and science, vol. 14. Tokyo, Japan, May 12-18, 2009, 283-284 [in Japanese].; Takai Y, Nagai G. Fast calculation of conjugate gradient method by GPU. In: Proceedings of the conference on computational engineering and science, vol. 14. Tokyo, Japan, May 12-18, 2009, 283-284 [in Japanese].
[30] Vazques F, Garzon EM, Martinez JA, Fernandez JJ. The sparse matrix vector product on GPUs. <http://www.ace.ual.es/TR/SpMV.GPU.pdf; Vazques F, Garzon EM, Martinez JA, Fernandez JJ. The sparse matrix vector product on GPUs. <http://www.ace.ual.es/TR/SpMV.GPU.pdf
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.