A Template for Scalable Continuum Dynamic Simulations in Multiple GPUs

Becerra-Sagredo, Julián; Mandujano, Francisco; Málaga, Carlos; Klapp, Jaime; de Teresa, Irene

doi:10.1007/978-3-319-32243-8_33

Julián Becerra-Sagredo¹²,
Francisco Mandujano¹³,
Carlos Málaga¹³,
Jaime Klapp^12,14 &
…
Irene de Teresa¹⁵

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 595))

Included in the following conference series:

International Conference on Supercomputing in Mexico

901 Accesses
1 Citations

Abstract

In this work we present a programming philosophy and a template code for achieving computational scalability when using multiple graphics processing units (GPUs) in the numerical solution of any mathematical system of equations found in continuum dynamic simulations. The programming philosophy exploits the principal characteristics of the GPU hardware, with emphasis in the delivering of threads with massive memory fetches, intense calculations using local registers and limited writes to global memory. The philosophy requires explicit formulas for calculations for which domain decomposition is trivial. The domains are decomposed in regions that use the local central processing unit (CPU) to communicate common interfaces using the message passing interface (MPI). A template code for the heat equation is established and tested for scalability. The novelty is that we show a series of codes, constructed from the basic template, that solve all the basic model equations found in continuum dynamics, and present illustrative results. The model equations are the heat equation, the Poisson equation, the shallow-water equations, the flow in porous media equations and the vorticity equations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Nickolls, J., Dally, W.J.: The GPU computing era. IEEE Micro 30(2), 56–69 (2010)
Article Google Scholar
Keckler, S.W., Dally, W.J., Khailany, B., Garland, M., Glasco, D.: GPUs and the future of parallel computing. IEEE Micro 31(5), 7–17 (2011)
Article Google Scholar
NVIDIA CUDA C Programming Guide, version 7.5. Nvidia (2015)
Google Scholar
Göddeke, D., Strzodka, R., Mohd-Yusof, J., McCormick, P., Buijssen, S.H.M., Grajewski, M., Turek, S.: Exploring weak scalability for FEM calculations on a GPU-enhanced cluster. Parallel Comput. 33(10–11), 685–699 (2007)
Article Google Scholar
Lu, F., Pang, Y., Yin, F.: Scalability analysis of parallel algorithms on GPU clusters. J. Comp. Inf. Syst. 10(14), 5947–5957 (2014)
Google Scholar
Strang, G., Fix, G.: An Analysis of the Finite Element Method. SIAM, Wellesley Cambridge Press, Wellesley (1973)
MATH Google Scholar
Kuzmin, D., Hämäläinen, J.: Finite Element Methods for Computational Fluid Dynamics: A Practical Guide. Computational Science and Engineering. SIAM, Philadelphia (2014)
MATH Google Scholar
Löner, R., Morgan, K., Peraire, J., Zienkiewicz, O.C.: Recent developments in FEM-CFD. In: Fritts, M.J., Crowley, W.P., Trease, H. (eds.) The Free-Lagrange Method. Lecture Notes in Physics, vol. 238, pp. 236–254. Springer, Heidelberg (2005)
Chapter Google Scholar
Yazid, A., Abdelkader, N., Abdelmadjid, H.: A state-of-the-art review of the X-FEM for computational fracture mechanics. Appl. Math. Model. 33(12), 4269–4282 (2009)
Article MathSciNet MATH Google Scholar
Sukumar, N., Malsch, E.A.: Recent advances in the construction of polygonal finite element interpolants. Arch. Comput. Meth. Eng. 13(1), 129–163 (2006)
Article MathSciNet MATH Google Scholar
Belytschko, T., Gracie, R., Ventura, G.: A review of extended/generalized finite element methods for material modeling. Modell. Simul. Mater. Sci. Eng. 17(4), 1–24 (2009)
Article MathSciNet MATH Google Scholar
Schweitzer, M.A.: Generalizations of the finite element method. Cent. Eur. J. Math. 10(1), 3–24 (2012)
Article MathSciNet MATH Google Scholar
Long, Y.Q., Long, Z.F., Cen, S.: Advanced Finite Element Method in Structural Engineering. Springer, Heidelberg (2009)
Book MATH Google Scholar
Saad, Y., Schultz, M.H.: GMRES: a generalized minimal residual algorithm for solving nonsymmetric linear system. SIAM J. Sci. Stat. Comput. 7, 856–869 (1986)
Article MathSciNet MATH Google Scholar
Hamandi, L.: Review of domain-decomposition methods for the implementation of FEM on massively parallel computers. IEEE Antennas Propag. Mag. 37(1), 93–98 (1995)
Article Google Scholar
Kruzel, F., Banaś, K.: Vectorized OpenCL implementation of numerical integration for higher order finite elements. Comp. Math. Appl. 66(10), 2030–2044 (2013)
Article Google Scholar
Khodja, L.Z., Couturier, R., Glersch, A., Bahi, J.M.: Parallel sparse linear solver with GMRES method using minimization techniques of communications for GPU clusters. J. Supercomput. 69(1), 200–224 (2014)
Article Google Scholar
Turek, S., Göddeke, D., Becker, C., Buijssen, S., Wobker, H.: UCHPC - unconventional high performance computing for finite element simulations. In: International Supercomputing Conference, ISC 2008 (2008)
Google Scholar
Płaszewski, P., Macioł, P., Banaś, K.: Finite element numerical integration on GPUs. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Wasniewski, J. (eds.) PPAM 2009, Part I. LNCS, vol. 6067, pp. 411–420. Springer, Heidelberg (2010)
Chapter Google Scholar
Dick, C., Georgii, J., Westermann, R.: A real-time multigrid hexahedra method for elasticity simulation using CUDA. Simul. Model. Pract. Theory 19(2), 801–816 (2011)
Article Google Scholar
Banaś, K., Plaszewski, P., Maciol, P.: Numerical integration on GPUs for higher order finite elements. Comp. Math. Appl. 67(6), 1319–1344 (2014)
Article MathSciNet Google Scholar
Huthwaite, P.: Accelerated finite element elastodynamic simulations using the GPU. J. Comp. Phys. 257(Part A), 687–707 (2014)
Article MathSciNet Google Scholar
Martínez-Frutos, J., Martínez-Castejón, P.J., Herrero-Pérez, D.: Fine-grained GPU implementation of assembly-free iterative solver for finite element problems. Comput. Struct. 157, 9–18 (2015)
Article Google Scholar
LeVeque, R.J.: Finite Volume Methods for Hyperbolic Problems. Cambridge Texts in Applied Mathematics. Cambridge University Press, Cambridge (2002)
Book MATH Google Scholar
Oñate, E., Owen, R.: Particle-Based Methods: Fundamentals and Applications. Computational Methods in Applied Sciences. Springer, Heidelberg (2011)
Book MATH Google Scholar
Falcone, M., Ferretti, R.: Semi-Lagrangian Approximation Schemes for Linear and Hamilton-Jacobi Equations. Other Titles in Applied Mathematics. SIAM, Philadelphia (2013)
Book MATH Google Scholar
Becerra-Sagredo, J., Málaga, C., Mandujano, F.: Moments Preservind and high-resolution Semi-Lagrangian Advection Scheme (2014). arXiv: 1410.2817
Becerra-Sagredo, J., Málaga, C., Mandujano, F.: A novel and scalable Multigrid algorithm for many-core architectures (2011). arXiv:1108.2045

Download references

Acknowledgements

This work was partially supported by ABACUS, CONACyT grant EDOMEX-2011-C01-165873.

Author information

Authors and Affiliations

“ABACUS” Centro de Matemáticas Aplicadas y Cómputo de Alto Rendimiento, Departamento de Matemáticas, Centro de Investigación y de Estudios Avanzados (CINVESTAV-IPN), Carretera México-Toluca Km 38.5, La Marquesa, 52140, Ocoyoacac, Estado de México, Mexico
Julián Becerra-Sagredo & Jaime Klapp
Departamento de Física, Facultad de Ciencias, UNAM, Ciudad Universitaria, 04510, Mexico, D.F., Mexico
Francisco Mandujano & Carlos Málaga
Departamento de Física, Instituto Nacional de Investigaciones Nucleares, La Marquesa Ocoyoacac s/n, Ocoyoacac, Edo. de México, Mexico
Jaime Klapp
Department of Mathematical Sciences, University of Delaware, Newark, DE, 19716, USA
Irene de Teresa

Authors

Julián Becerra-Sagredo
View author publications
You can also search for this author in PubMed Google Scholar
Francisco Mandujano
View author publications
You can also search for this author in PubMed Google Scholar
Carlos Málaga
View author publications
You can also search for this author in PubMed Google Scholar
Jaime Klapp
View author publications
You can also search for this author in PubMed Google Scholar
Irene de Teresa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Julián Becerra-Sagredo .

Editor information

Editors and Affiliations

ABACUS Centro de Matemáticas Aplicadas, CINVESTAV-IPN, La Marquesa, Mexico
Isidoro Gitler
Instituto Nacional de Investigaciones Nu, La Marquesa, Mexico
Jaime Klapp

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Becerra-Sagredo, J., Mandujano, F., Málaga, C., Klapp, J., de Teresa, I. (2016). A Template for Scalable Continuum Dynamic Simulations in Multiple GPUs. In: Gitler, I., Klapp, J. (eds) High Performance Computer Applications. ISUM 2015. Communications in Computer and Information Science, vol 595. Springer, Cham. https://doi.org/10.1007/978-3-319-32243-8_33

Download citation

DOI: https://doi.org/10.1007/978-3-319-32243-8_33
Published: 08 April 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-32242-1
Online ISBN: 978-3-319-32243-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics