Bull, Jonathan; Engblom, Stefan Distributed and adaptive fast multipole method in three dimensions. (English) Zbl 1473.65004 Commun. Comput. Phys. 30, No. 4, 959-984 (2021). MSC: 65B10 65Y05 65Y10 65Y20 68W15 PDFBibTeX XMLCite \textit{J. Bull} and \textit{S. Engblom}, Commun. Comput. Phys. 30, No. 4, 959--984 (2021; Zbl 1473.65004) Full Text: DOI arXiv
Bosner, Nela Parallel Prony’s method with multivariate matrix pencil approach and its numerical aspects. (English) Zbl 07340706 SIAM J. Matrix Anal. Appl. 42, No. 2, 635-658 (2021). MSC: 65F99 65G99 65Y05 65Y20 65Z05 PDFBibTeX XMLCite \textit{N. Bosner}, SIAM J. Matrix Anal. Appl. 42, No. 2, 635--658 (2021; Zbl 07340706) Full Text: DOI arXiv
Foadaddini, Ali; Zolfaghari, Seyed Alireza; Darian, Hossein Mahmoodi; Saadatfar, Hamid An efficient GPU-based fractional-step domain decomposition scheme for the reaction-diffusion equation. (English) Zbl 1476.65229 Comput. Appl. Math. 39, No. 4, Paper No. 305, 35 p. (2020). MSC: 65M55 65M06 65Y05 65Y20 PDFBibTeX XMLCite \textit{A. Foadaddini} et al., Comput. Appl. Math. 39, No. 4, Paper No. 305, 35 p. (2020; Zbl 1476.65229) Full Text: DOI
Mena, Hermann; Pfurtscheller, Lena-Maria; Stillfjord, Tony GPU acceleration of splitting schemes applied to differential matrix equations. (English) Zbl 1441.65057 Numer. Algorithms 83, No. 1, 395-419 (2020). Reviewer: Andreas Mang (Houston) MSC: 65L05 65F60 65Y20 PDFBibTeX XMLCite \textit{H. Mena} et al., Numer. Algorithms 83, No. 1, 395--419 (2020; Zbl 1441.65057) Full Text: DOI arXiv
Giuliani, Andrew; Krivodonova, Lilia Adaptive mesh refinement on graphics processing units for applications in gas dynamics. (English) Zbl 1451.65139 J. Comput. Phys. 381, 67-90 (2019). MSC: 65M50 65M60 76M10 35L65 65Y20 76N15 65Y05 PDFBibTeX XMLCite \textit{A. Giuliani} and \textit{L. Krivodonova}, J. Comput. Phys. 381, 67--90 (2019; Zbl 1451.65139) Full Text: DOI
Pera, Donato; Málaga, Carlos; Simeoni, Chiara; Plaza, Ramón G. On the efficient numerical simulation of heterogeneous anisotropic diffusion models for tumor invasion using GPUs. (English) Zbl 1437.35678 Rend. Mat. Appl., VII. Ser. 40, No. 3-4, 233-255 (2019). MSC: 35Q92 65M06 65Y05 65Y15 65Y20 92C17 92C37 92C50 PDFBibTeX XMLCite \textit{D. Pera} et al., Rend. Mat. Appl., VII. Ser. 40, No. 3--4, 233--255 (2019; Zbl 1437.35678) Full Text: Link
Cavoretto, Roberto; Schneider, Teseo; Zulian, Patrick OpenCL based parallel algorithm for RBF-PUM interpolation. (English) Zbl 1383.65011 J. Sci. Comput. 74, No. 1, 267-289 (2018). MSC: 65D05 65Y05 65Y20 PDFBibTeX XMLCite \textit{R. Cavoretto} et al., J. Sci. Comput. 74, No. 1, 267--289 (2018; Zbl 1383.65011) Full Text: DOI
Hanif, Muhammad Kashif; Zimmermann, Karl-Heinz Accelerating Viterbi algorithm on graphics processing units. (English) Zbl 1430.68434 Computing 99, No. 11, 1105-1123 (2017). MSC: 68W10 65Y05 65Y20 90C39 PDFBibTeX XMLCite \textit{M. K. Hanif} and \textit{K.-H. Zimmermann}, Computing 99, No. 11, 1105--1123 (2017; Zbl 1430.68434) Full Text: DOI
Chen, Cheng; Fang, Jianbin; Tang, Tao; Yang, Canqun LU factorization on heterogeneous systems: an energy-efficient approach towards high performance. (English) Zbl 1372.65091 Computing 99, No. 8, 791-811 (2017). MSC: 65F05 15A23 65Y05 65Y10 65Y20 PDFBibTeX XMLCite \textit{C. Chen} et al., Computing 99, No. 8, 791--811 (2017; Zbl 1372.65091) Full Text: DOI
Chen, Tianran; Lee, Tsung-Lin; Li, Tien-Yien Mixed cell computation in HOM4ps. (English) Zbl 1365.13047 J. Symb. Comput. 79, Part 3, 516-534 (2017). Reviewer: Thomas Kahle (Magdeburg) MSC: 13P15 68W10 52B55 65Y05 65Y20 05C85 14Q99 PDFBibTeX XMLCite \textit{T. Chen} et al., J. Symb. Comput. 79, Part 3, 516--534 (2017; Zbl 1365.13047) Full Text: DOI
Chen, Yuxin; Keyes, David; Law, Kody J. H.; Ltaief, Hatem Accelerated dimension-independent adaptive metropolis. (English) Zbl 1352.65029 SIAM J. Sci. Comput. 38, No. 5, S539-S565 (2016). MSC: 65C60 62F15 65C05 65C40 60J22 65Y20 65Y05 PDFBibTeX XMLCite \textit{Y. Chen} et al., SIAM J. Sci. Comput. 38, No. 5, S539--S565 (2016; Zbl 1352.65029) Full Text: DOI arXiv
Yonkee, Nathan; Sutherland, James C. PoKiTT: exposing task and data parallelism on heterogeneous architectures for detailed chemical kinetics, transport, and thermodynamics calculations. (English) Zbl 1382.80001 SIAM J. Sci. Comput. 38, No. 5, S264-S281 (2016). Reviewer: Aleksey Syromyasov (Saransk) MSC: 80-04 80A32 68W10 65Y05 65Y10 65Y20 68M20 68U20 PDFBibTeX XMLCite \textit{N. Yonkee} and \textit{J. C. Sutherland}, SIAM J. Sci. Comput. 38, No. 5, S264--S281 (2016; Zbl 1382.80001) Full Text: DOI
Hogg, Jonathan D.; Ovtchinnikov, Evgueni; Scott, Jennifer A. A sparse symmetric indefinite direct solver for GPU architectures. (English) Zbl 1347.65086 ACM Trans. Math. Softw. 42, No. 1, Article No. 1, 25 p. (2016). MSC: 65F50 65Y05 65Y10 65Y20 PDFBibTeX XMLCite \textit{J. D. Hogg} et al., ACM Trans. Math. Softw. 42, No. 1, Article No. 1, 25 p. (2016; Zbl 1347.65086) Full Text: DOI Link
Bernaschi, Massimo; Bisson, Mauro; Fantozzi, Carlo; Janna, Carlo A factored sparse approximate inverse preconditioned conjugate gradient solver on graphics processing units. (English) Zbl 1336.65036 SIAM J. Sci. Comput. 38, No. 1, C53-C72 (2016). MSC: 65F10 65F08 65F50 65Y05 65Y10 65Y20 PDFBibTeX XMLCite \textit{M. Bernaschi} et al., SIAM J. Sci. Comput. 38, No. 1, C53--C72 (2016; Zbl 1336.65036) Full Text: DOI
Bock, Nicolas; Challacombe, Matt; Kalé, Laxmikant V. Solvers for \(\mathcal{O}(N)\) electronic structure in the strong scaling limit. (English) Zbl 1336.65054 SIAM J. Sci. Comput. 38, No. 1, C1-C21 (2016). MSC: 65F15 65F30 65F35 65F50 65Y05 81V45 81V55 PDFBibTeX XMLCite \textit{N. Bock} et al., SIAM J. Sci. Comput. 38, No. 1, C1--C21 (2016; Zbl 1336.65054) Full Text: DOI arXiv
Mironowicz, P.; Dziekonski, A.; Mrozowski, M. A task-scheduling approach for efficient sparse symmetric matrix-vector multiplication on a GPU. (English) Zbl 1343.65046 SIAM J. Sci. Comput. 37, No. 6, C643-C666 (2015). MSC: 65F50 65Y05 65Y10 65Y20 PDFBibTeX XMLCite \textit{P. Mironowicz} et al., SIAM J. Sci. Comput. 37, No. 6, C643--C666 (2015; Zbl 1343.65046) Full Text: DOI
Mo, Tieqiang; Li, Renfa A new memory mapping mechanism for GPGPUs’ stencil computation. (English) Zbl 1321.65135 Computing 97, No. 8, 795-812 (2015). MSC: 65M06 35K05 65M50 65Y20 PDFBibTeX XMLCite \textit{T. Mo} and \textit{R. Li}, Computing 97, No. 8, 795--812 (2015; Zbl 1321.65135) Full Text: DOI
Malas, T.; Hager, G.; Ltaief, H.; Stengel, H.; Wellein, G.; Keyes, D. Multicore-optimized wavefront diamond blocking for optimizing stencil updates. (English) Zbl 1331.68286 SIAM J. Sci. Comput. 37, No. 4, C439-C464 (2015). MSC: 68W15 65Y05 68M14 68M20 68Q25 68W10 PDFBibTeX XMLCite \textit{T. Malas} et al., SIAM J. Sci. Comput. 37, No. 4, C439--C464 (2015; Zbl 1331.68286) Full Text: DOI arXiv
Woźniak, M.; Kuźnik, K.; Paszyński, M.; Calo, V. M.; Pardo, D. Computational cost estimates for parallel shared memory isogeometric multi-frontal solvers. (English) Zbl 1367.65257 Comput. Math. Appl. 67, No. 10, 1864-1883 (2014). MSC: 65Y20 65N30 65Y05 PDFBibTeX XMLCite \textit{M. Woźniak} et al., Comput. Math. Appl. 67, No. 10, 1864--1883 (2014; Zbl 1367.65257) Full Text: DOI
Lopez, M. Graham; Horton, Mitchel D. Batch matrix exponentiation. (English) Zbl 1317.65113 Kindratenko, Volodymyr (ed.), Numerical computations with GPUs. Cham: Springer (ISBN 978-3-319-06547-2/hbk; 978-3-319-06548-9/ebook). 45-67 (2014). MSC: 65F30 65F60 65Y10 65Y20 65D05 PDFBibTeX XMLCite \textit{M. G. Lopez} and \textit{M. D. Horton}, in: Numerical computations with GPUs. Cham: Springer. 45--67 (2014; Zbl 1317.65113) Full Text: DOI
Cottet, G.-H.; Etancelin, J.-M.; Perignon, F.; Picard, C. High order semi-Lagrangian particle methods for transport equations: numerical analysis and implementation issues. (English) Zbl 1310.65134 ESAIM, Math. Model. Numer. Anal. 48, No. 4, 1029-1060 (2014). Reviewer: Krzysztof Moszyński (Warszawa) MSC: 65M75 65M12 65Y05 65Y20 35L02 PDFBibTeX XMLCite \textit{G. H. Cottet} et al., ESAIM, Math. Model. Numer. Anal. 48, No. 4, 1029--1060 (2014; Zbl 1310.65134) Full Text: DOI
Kriemann, Ronald \(\mathcal H\)-LU factorization on many-core systems. (English) Zbl 1388.65210 Comput. Vis. Sci. 16, No. 3, 105-117 (2013). MSC: 65Y05 65F05 65Y20 68W10 68W40 PDFBibTeX XMLCite \textit{R. Kriemann}, Comput. Vis. Sci. 16, No. 3, 105--117 (2013; Zbl 1388.65210) Full Text: DOI
Panchatcharam, M.; Sundar, S.; Vetrivel, V.; Klar, A.; Tiwari, S. GPU computing for meshfree particle method. (English) Zbl 1463.65452 Int. J. Numer. Anal. Model., Ser. B 4, No. 4, 394-412 (2013). MSC: 65Y05 65Y20 35Q30 76D05 PDFBibTeX XMLCite \textit{M. Panchatcharam} et al., Int. J. Numer. Anal. Model., Ser. B 4, No. 4, 394--412 (2013; Zbl 1463.65452)
Fischer, Cyril Massive parallel implementation of ODE solvers. (English) Zbl 1340.65135 Chleboun, J. (ed.) et al., Programs and algorithms of numerical mathematics 16. Proceedings of the 16th seminar (PANM), Dolní Maxov, Czech Republic, June 3–8, 2012. Prague: Academy of Sciences of the Czech Republic, Institute of Mathematics (ISBN 978-80-85823-62-2). 75-80 (2013). Reviewer: Jan Přikryl (Praha) MSC: 65L05 65Y05 65Y10 65Y15 65Y20 34-04 PDFBibTeX XMLCite \textit{C. Fischer}, in: Programs and algorithms of numerical mathematics 16. Proceedings of the 16th seminar (PANM), Dolní Maxov, Czech Republic, June 3--8, 2012. Prague: Academy of Sciences of the Czech Republic, Institute of Mathematics. 75--80 (2013; Zbl 1340.65135) Full Text: Link
Arnold, Mark G. Extending DNA-sticker arithmetic to arbitrary size using staples. (English) Zbl 1409.68096 Soloveichik, David (ed.) et al., DNA computing and molecular programming. 19th international conference, DNA 19, Tempe, AZ, USA, September 22–27, 2013. Proceedings. Berlin: Springer. Lect. Notes Comput. Sci. 8141, 1-15 (2013). MSC: 68Q05 68Q10 68Q25 92D20 PDFBibTeX XMLCite \textit{M. G. Arnold}, Lect. Notes Comput. Sci. 8141, 1--15 (2013; Zbl 1409.68096) Full Text: DOI
Hogg, J. D. A fast dense triangular solve in CUDA. (English) Zbl 1275.65018 SIAM J. Sci. Comput. 35, No. 3, C303-C322 (2013). MSC: 65F05 65F50 65Y05 65Y20 65Y10 PDFBibTeX XMLCite \textit{J. D. Hogg}, SIAM J. Sci. Comput. 35, No. 3, C303--C322 (2013; Zbl 1275.65018) Full Text: DOI
Panchatcharam, M.; Sundar, S. Finite pointset method for 2D dam-break problem with GPU-acceleration. (English) Zbl 1282.76091 Int. J. Appl. Math. 25, No. 4, 547-557 (2012). MSC: 76D05 65Y05 65Y20 35Q30 PDFBibTeX XMLCite \textit{M. Panchatcharam} and \textit{S. Sundar}, Int. J. Appl. Math. 25, No. 4, 547--557 (2012; Zbl 1282.76091)
Miszczak, Jarosław Adam Generating and using truly random quantum states in Mathematica. (English) Zbl 1263.81116 Comput. Phys. Commun. 183, No. 1, 118-124 (2012). MSC: 81P68 81P45 65C10 81-08 68N30 68Q12 PDFBibTeX XMLCite \textit{J. A. Miszczak}, Comput. Phys. Commun. 183, No. 1, 118--124 (2012; Zbl 1263.81116) Full Text: DOI
Luo, Yuancheng; Duraiswami, Ramani Efficient parallel nonnegative least squares on multicore architectures. (English) Zbl 1232.65194 SIAM J. Sci. Comput. 33, No. 5, 2848-2863 (2011). MSC: 65Y05 65Y20 15A06 15A23 PDFBibTeX XMLCite \textit{Y. Luo} and \textit{R. Duraiswami}, SIAM J. Sci. Comput. 33, No. 5, 2848--2863 (2011; Zbl 1232.65194) Full Text: DOI Link
Loke, T.; Wang, J. B. An efficient quantum circuit analyser on qubits and qudits. (English) Zbl 1223.81081 Comput. Phys. Commun. 182, No. 10, 2285-2294 (2011). MSC: 81P68 68Q12 81T80 PDFBibTeX XMLCite \textit{T. Loke} and \textit{J. B. Wang}, Comput. Phys. Commun. 182, No. 10, 2285--2294 (2011; Zbl 1223.81081) Full Text: DOI arXiv
Appleyard, J.; Drikakis, D. Higher-order CFD and interface tracking methods on highly-Parallel MPI and GPU systems. (English) Zbl 1431.76005 Comput. Fluids 46, No. 1, 101-105 (2011). MSC: 76-06 76M99 65Y05 65Y20 PDFBibTeX XMLCite \textit{J. Appleyard} and \textit{D. Drikakis}, Comput. Fluids 46, No. 1, 101--105 (2011; Zbl 1431.76005) Full Text: DOI Link
Dubois, Jérôme; Calvin, Christophe; Petiton, Serge Performance and numerical accuracy evaluation of heterogeneous multicore systems for Krylov orthogonal basis computation. (English) Zbl 1323.65145 Palma, José M. Laginha M. (ed.) et al., High performance computing for computational science – VECPAR 2010. 9th international conference, Berkeley, CA, USA, June 22–25, 2010. Revised selected papers. Berlin: Springer (ISBN 978-3-642-19327-9/pbk). Lecture Notes in Computer Science 6449, 45-57 (2011). MSC: 65Y20 65F25 65Y05 65Y10 PDFBibTeX XMLCite \textit{J. Dubois} et al., Lect. Notes Comput. Sci. 6449, 45--57 (2011; Zbl 1323.65145) Full Text: DOI
Rostrup, Scott; De Sterck, Hans Parallel hyperbolic PDE simulation on clusters: Cell versus GPU. (English) Zbl 1432.35002 Comput. Phys. Commun. 181, No. 12, 2164-2179 (2010). MSC: 35-04 35L99 65Y05 65Y20 PDFBibTeX XMLCite \textit{S. Rostrup} and \textit{H. De Sterck}, Comput. Phys. Commun. 181, No. 12, 2164--2179 (2010; Zbl 1432.35002) Full Text: DOI
Bernstein, Daniel J.; Chen, Hsieh-Chung; Cheng, Chen-Mou; Lange, Tanja; Niederhagen, Ruben; Schwabe, Peter; Yang, Bo-Yin ECC2K-130 on NVIDIA GPUs. (English) Zbl 1248.94054 Gong, Guang (ed.) et al., Progress in cryptology – INDOCRYPT 2010. 11th international conference on cryptology in India, Hyderabad, India, December 12–15, 2010. Proceedings. Berlin: Springer (ISBN 978-3-642-17400-1/pbk). Lecture Notes in Computer Science 6498, 328-346 (2010). MSC: 94A60 11Y16 14G50 68W30 PDFBibTeX XMLCite \textit{D. J. Bernstein} et al., Lect. Notes Comput. Sci. 6498, 328--346 (2010; Zbl 1248.94054) Full Text: DOI Link
Klöckner, A.; Warburton, T.; Bridge, J.; Hesthaven, J. S. Nodal discontinuous Galerkin methods on graphics processors. (English) Zbl 1175.65111 J. Comput. Phys. 228, No. 21, 7863-7882 (2009). MSC: 65M60 35L65 65Y20 65Y05 35Q61 PDFBibTeX XMLCite \textit{A. Klöckner} et al., J. Comput. Phys. 228, No. 21, 7863--7882 (2009; Zbl 1175.65111) Full Text: DOI arXiv
Gumerov, Nail A.; Duraiswami, Ramani Fast multipole methods on graphics processors. (English) Zbl 1147.65012 J. Comput. Phys. 227, No. 18, 8290-8313 (2008). MSC: 65D15 65B10 65N38 65Y05 65Y15 65Y20 PDFBibTeX XMLCite \textit{N. A. Gumerov} and \textit{R. Duraiswami}, J. Comput. Phys. 227, No. 18, 8290--8313 (2008; Zbl 1147.65012) Full Text: DOI Link