×

Parallel technology for numerical modeling of fluid dynamics problems by high-accuracy algorithms. (English. Russian original) Zbl 1320.76081

Comput. Math. Math. Phys. 55, No. 4, 638-649 (2015); translation from Zh. Vychisl. Mat. Mat. Fiz. 55, No. 4, 641-652 (2015).
Summary: A parallel computation technology for modeling fluid dynamics problems by finite-volume and finite-difference methods of high accuracy is presented. The development of an algorithm, the design of a software implementation, and the creation of parallel programs for computations on large-scale computing systems are considered. The presented parallel technology is based on a multilevel parallel model combining various types of parallelism: with shared and distributed memory and with multiple and single instruction streams to multiple data flows.

MSC:

76M12 Finite volume methods applied to problems in fluid mechanics
76M20 Finite difference methods applied to problems in fluid mechanics
68W10 Parallel algorithms in computer science
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Ramirez, A., The Mont-Blanc architecture (2012)
[2] V. V. Voevodin and Vl. V. Voevodin, Parallel Computations (BKhV-Peterburg, St. Petersburg, 2002) [in Russian]. · Zbl 0825.65135
[3] A. Grama, G. Karypis, V. Kumar, and A. Gupta, Introduction to Parallel Computing, 2nd ed. (Addison-Wesley, Reading, MA, 2003). · Zbl 0861.68040
[4] Yu. B. Vasilevskii, I. N. Kon’shin, G. V. Kopytov, and K. M. Terekhov, INMOST: Program Platform and Graphic Environment for Development of Parallel Numerical Models on General Grids (Mosk. Gos. Univ., Moscow, 2013) [in Russian].
[5] R. Aubry, G. Houzeaux, M. Vazquez, and J. M. Cela, “Some useful strategies for unstructured edge-based solvers on shared memory machines,” Int. J. Numerical Meth. Eng. 85, 537-561 (2010). · Zbl 1217.76066
[6] P. B. Bogdanov, A. A. Efremov, A. V. Gorobets, and S. A. Sukov, “Application of a scheduler for efficient data exchange on hybrid supercomputers with massively parallel accelerators,” Vychisl. Metody Program. 14, 122-134 (2013).
[7] MPI: A Message-Passing Interface Standard, Version 3.0, Message Passing Interface, Forum, September 21, 2012 (http://www.mpi-forum.org/docs/mpi-3.0/mpi30-report.pdf).
[8] OpenMP Application Program Interface, Version 4.0, July 2013 (http://www.openmp.org/mp-documents/OpenMP4.0.0.pdf). · Zbl 1356.76139
[9] Bogdanov, P.; Efremov, A. A., Programming infrastructure of heterogeneous computing based on OpenCL and its applications (2013)
[10] P. B. Bogdanov, A. V. Gorobets, and S. A. Sukov, “Adaptation and optimization of basic operations for an unstructured mesh CFD algorithm for computation on massively parallel accelerators,” Comput. Math. Math. Phys. 53(8), 1383-1394 (2013). · Zbl 1313.76084
[11] Barth, T., Numerical methods for conservation laws on structured and unstructured meshes (2003)
[12] I. V. Abalakin and T. K. Kozubskaya, “A scheme based on edge-oriented quasi-1D reconstruction of variables for solving aerodynamics and aeroacoustics problems on unstructured meshes,” Mat. Model. 25(8), 109-136 (2013). · Zbl 1356.76139
[13] P. A. Bakhvalov, “Quasi-one-dimensional reconstruction scheme on convex polygonal meshes for solving aeroacoustics problems,” Mat. Model. 25, 95-108 (2013). · Zbl 1356.76298
[14] A. Gorobets, F. X. Trias, and A. Oliva, “A parallel MPI + OpenMP + OpenCL algorithm for hybrid supercomputations of incompressible flows,” Comput. Fluids 88, 764-772 (2013). · Zbl 1391.76090
[15] M. Soria, C. D. Perez-Segarra, and A. Oliva, “A direct parallel algorithm for the efficient solution of the pressure-correction equation of incompressible flow problems using loosely coupled computers,” Numer. Heat Transfer Part B 41, 117-138 (2002).
[16] K. Schloegel, G. Karypis, and V. Kumar, “Parallel static and dynamic multi-constraint graph partitioning,” Concurrency Comput. Practice Experience 14(3), 219-240 (2002). · Zbl 1012.68146
[17] F. Pellegrini, PT-Scotch and libPTScotch 6.0 User’s Guide, 2012 (https://gforge.iniia.fr/docman/view.php/248/8261/ptscotch-user6.0.pdf).
[18] E. N. Golovchenko, “Parallel software package for large mesh decomposition,” Mat. Model. 23(10), 3-18 (2011).
[19] B. Krasnopolsky, “The reordered BiCGStab method for distributed memory computer systems,” ICCS 2010 Procedia Comput. Sci. 1(1), 213-218 (2010).
[20] Gorobets, A. V.; Sukov, S. A.; Bogdanov, P. B., Toward heterogeneous supercomputing in CFD, 23-34 (2013)
[21] Cuthill, E.; McKee, J., Reducing the bandwidth of sparse symmetric matrices, 157-172 (1969)
[22] Gorobets, A. V.; Sukov, S. A.; Zheleznyakov, A. O.; Bogdanov, P. B.; Chetverushkin, B. N., Application of GPU in hybrid two-level MPI + OpenMP parallelization on heterogeneous computing systems, 76-86 (2011)
[23] A. Monakov, A. Lokhmotov, and A. Avetisyan, “Automatically tuning sparse matrix-vector multiplication for GPU architectures,” High Performance Embedded Architectures and Compilers, Lect. Notes Comput. Sci. 5952, 111-125 (2010).
[24] I. V. Abalakin, P. A. Bakhvalov, A. V. Gorobets, A. P. Duben’, T. K. Kozubskaya, “Parallel research code NOISEtte for large-scale CFD and CAA simulations,” Vychisl. Metody Program. 13, 110-125 (2012).
[25] G. Oyarzun, R. Borrell, A. Gorobets, and A. Oliva, “MPI-CUDA sparse matrix-vector multiplication for the conjugate gradient method with an approximate inverse preconditioner,” Comput. Fluids 92, 244-252 (2014). · Zbl 1390.65038
[26] F. X. Trias, O. Lehmkuhl, A. Oliva, C. D. Pérez-Segarra, R. W. C. P. Verstappen, “Symmetry-preserving discretization of Navier-Stokes equations on collocated unstructured grids,” J. Comput. Phys. 258(1), 246-267 (2014). · Zbl 1349.65386
[27] R. W. C. P. Verstappen and A. E. P. Veldman, “Symmetry-preserving discretization of turbulent flow,” J. Comput. Phys. 187, 343-368 (2003). · Zbl 1062.76542
[28] V. Titarev, M. Dumbser, and S. Utyuzhnikov, “Construction and comparison of parallel implicit kinetic solvers in three spatial dimensions,” J. Comput. Phys. 256, 17-33 (2014). · Zbl 1349.76741
[29] A Guide to Vectorization with Intel C++ Compilers, 2012 (http://download-software.intel.com/sites/default/files/8c/a9/CompilerAutovectorizationGuide.pdf).
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.