Skip to main content

A Template for Scalable Continuum Dynamic Simulations in Multiple GPUs

  • Conference paper
  • First Online:
High Performance Computer Applications (ISUM 2015)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 595))

Included in the following conference series:

Abstract

In this work we present a programming philosophy and a template code for achieving computational scalability when using multiple graphics processing units (GPUs) in the numerical solution of any mathematical system of equations found in continuum dynamic simulations. The programming philosophy exploits the principal characteristics of the GPU hardware, with emphasis in the delivering of threads with massive memory fetches, intense calculations using local registers and limited writes to global memory. The philosophy requires explicit formulas for calculations for which domain decomposition is trivial. The domains are decomposed in regions that use the local central processing unit (CPU) to communicate common interfaces using the message passing interface (MPI). A template code for the heat equation is established and tested for scalability. The novelty is that we show a series of codes, constructed from the basic template, that solve all the basic model equations found in continuum dynamics, and present illustrative results. The model equations are the heat equation, the Poisson equation, the shallow-water equations, the flow in porous media equations and the vorticity equations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Nickolls, J., Dally, W.J.: The GPU computing era. IEEE Micro 30(2), 56–69 (2010)

    Article  Google Scholar 

  2. Keckler, S.W., Dally, W.J., Khailany, B., Garland, M., Glasco, D.: GPUs and the future of parallel computing. IEEE Micro 31(5), 7–17 (2011)

    Article  Google Scholar 

  3. NVIDIA CUDA C Programming Guide, version 7.5. Nvidia (2015)

    Google Scholar 

  4. Göddeke, D., Strzodka, R., Mohd-Yusof, J., McCormick, P., Buijssen, S.H.M., Grajewski, M., Turek, S.: Exploring weak scalability for FEM calculations on a GPU-enhanced cluster. Parallel Comput. 33(10–11), 685–699 (2007)

    Article  Google Scholar 

  5. Lu, F., Pang, Y., Yin, F.: Scalability analysis of parallel algorithms on GPU clusters. J. Comp. Inf. Syst. 10(14), 5947–5957 (2014)

    Google Scholar 

  6. Strang, G., Fix, G.: An Analysis of the Finite Element Method. SIAM, Wellesley Cambridge Press, Wellesley (1973)

    MATH  Google Scholar 

  7. Kuzmin, D., Hämäläinen, J.: Finite Element Methods for Computational Fluid Dynamics: A Practical Guide. Computational Science and Engineering. SIAM, Philadelphia (2014)

    MATH  Google Scholar 

  8. Löner, R., Morgan, K., Peraire, J., Zienkiewicz, O.C.: Recent developments in FEM-CFD. In: Fritts, M.J., Crowley, W.P., Trease, H. (eds.) The Free-Lagrange Method. Lecture Notes in Physics, vol. 238, pp. 236–254. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  9. Yazid, A., Abdelkader, N., Abdelmadjid, H.: A state-of-the-art review of the X-FEM for computational fracture mechanics. Appl. Math. Model. 33(12), 4269–4282 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  10. Sukumar, N., Malsch, E.A.: Recent advances in the construction of polygonal finite element interpolants. Arch. Comput. Meth. Eng. 13(1), 129–163 (2006)

    Article  MathSciNet  MATH  Google Scholar 

  11. Belytschko, T., Gracie, R., Ventura, G.: A review of extended/generalized finite element methods for material modeling. Modell. Simul. Mater. Sci. Eng. 17(4), 1–24 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  12. Schweitzer, M.A.: Generalizations of the finite element method. Cent. Eur. J. Math. 10(1), 3–24 (2012)

    Article  MathSciNet  MATH  Google Scholar 

  13. Long, Y.Q., Long, Z.F., Cen, S.: Advanced Finite Element Method in Structural Engineering. Springer, Heidelberg (2009)

    Book  MATH  Google Scholar 

  14. Saad, Y., Schultz, M.H.: GMRES: a generalized minimal residual algorithm for solving nonsymmetric linear system. SIAM J. Sci. Stat. Comput. 7, 856–869 (1986)

    Article  MathSciNet  MATH  Google Scholar 

  15. Hamandi, L.: Review of domain-decomposition methods for the implementation of FEM on massively parallel computers. IEEE Antennas Propag. Mag. 37(1), 93–98 (1995)

    Article  Google Scholar 

  16. Kruzel, F., Banaś, K.: Vectorized OpenCL implementation of numerical integration for higher order finite elements. Comp. Math. Appl. 66(10), 2030–2044 (2013)

    Article  Google Scholar 

  17. Khodja, L.Z., Couturier, R., Glersch, A., Bahi, J.M.: Parallel sparse linear solver with GMRES method using minimization techniques of communications for GPU clusters. J. Supercomput. 69(1), 200–224 (2014)

    Article  Google Scholar 

  18. Turek, S., Göddeke, D., Becker, C., Buijssen, S., Wobker, H.: UCHPC - unconventional high performance computing for finite element simulations. In: International Supercomputing Conference, ISC 2008 (2008)

    Google Scholar 

  19. Płaszewski, P., Macioł, P., Banaś, K.: Finite element numerical integration on GPUs. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Wasniewski, J. (eds.) PPAM 2009, Part I. LNCS, vol. 6067, pp. 411–420. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  20. Dick, C., Georgii, J., Westermann, R.: A real-time multigrid hexahedra method for elasticity simulation using CUDA. Simul. Model. Pract. Theory 19(2), 801–816 (2011)

    Article  Google Scholar 

  21. Banaś, K., Plaszewski, P., Maciol, P.: Numerical integration on GPUs for higher order finite elements. Comp. Math. Appl. 67(6), 1319–1344 (2014)

    Article  MathSciNet  Google Scholar 

  22. Huthwaite, P.: Accelerated finite element elastodynamic simulations using the GPU. J. Comp. Phys. 257(Part A), 687–707 (2014)

    Article  MathSciNet  Google Scholar 

  23. Martínez-Frutos, J., Martínez-Castejón, P.J., Herrero-Pérez, D.: Fine-grained GPU implementation of assembly-free iterative solver for finite element problems. Comput. Struct. 157, 9–18 (2015)

    Article  Google Scholar 

  24. LeVeque, R.J.: Finite Volume Methods for Hyperbolic Problems. Cambridge Texts in Applied Mathematics. Cambridge University Press, Cambridge (2002)

    Book  MATH  Google Scholar 

  25. Oñate, E., Owen, R.: Particle-Based Methods: Fundamentals and Applications. Computational Methods in Applied Sciences. Springer, Heidelberg (2011)

    Book  MATH  Google Scholar 

  26. Falcone, M., Ferretti, R.: Semi-Lagrangian Approximation Schemes for Linear and Hamilton-Jacobi Equations. Other Titles in Applied Mathematics. SIAM, Philadelphia (2013)

    Book  MATH  Google Scholar 

  27. Becerra-Sagredo, J., Málaga, C., Mandujano, F.: Moments Preservind and high-resolution Semi-Lagrangian Advection Scheme (2014). arXiv: 1410.2817

  28. Becerra-Sagredo, J., Málaga, C., Mandujano, F.: A novel and scalable Multigrid algorithm for many-core architectures (2011). arXiv:1108.2045

Download references

Acknowledgements

This work was partially supported by ABACUS, CONACyT grant EDOMEX-2011-C01-165873.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Julián Becerra-Sagredo .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Becerra-Sagredo, J., Mandujano, F., Málaga, C., Klapp, J., de Teresa, I. (2016). A Template for Scalable Continuum Dynamic Simulations in Multiple GPUs. In: Gitler, I., Klapp, J. (eds) High Performance Computer Applications. ISUM 2015. Communications in Computer and Information Science, vol 595. Springer, Cham. https://doi.org/10.1007/978-3-319-32243-8_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-32243-8_33

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-32242-1

  • Online ISBN: 978-3-319-32243-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics