skip to main content
10.1145/3183767.3183769acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesparma-ditamConference Proceedingsconference-collections
research-article

Managing Heterogeneous Resources in HPC Systems

Authors Info & Claims
Published:23 January 2018Publication History

ABSTRACT

To sustain performance while facing always tighter power and energy envelopes, High Performance Computing (HPC) is increasingly leveraging heterogeneous architectures. This poses new challenges: to efficiently exploit the available resources, both in terms of hardware and energy, resource management must support a wide range of different heterogeneous devices and programming models that target different application domains. We present a strategy for resource management and programming model support for heterogeneous accelerators for HPC systems with requirements targeting performance, power and predictability. We show how resource management can, in addition to allowing multiple applications to share a set of resources, reduce the burden on the application developer and improve the efficiency of resource allocation.

References

  1. ARB 2008. OpenMP Application Program Interface, version 3.0. ARB. http://www.openmp.orgGoogle ScholarGoogle Scholar
  2. Cédric Augonnet, Samuel Thibault, Raymond Namyst, and Pierre-André Wacrenier. 2011. StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures. Concurr. Comput.: Pract. Exper. 23, 2 (Feb. 2011), 187--198. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Patrick Bellasi, Giuseppe Massari, and William Fornaciari. 2015. Effective Runtime Resource Management Using Linux Control Groups with the BarbequeRTRM Framework. ACM Trans. Embed. Comput. Syst. 14, 2, Article 39 (March 2015), 17 pages. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. José Flich, Giovanni Agosta, Philipp Ampletzer, David Atienza Alonso, Carlo Brandolese, Etienne Cappe, Alessandro Cilardo, Leon Dragić, Alexandre Dray, Alen Duspara, et al. 2017. MANGO: Exploring Manycore Architectures for Next-GeneratiOn HPC Systems. In 2017 Euromicro Conference on Digital System Design (DSD). 478--485.Google ScholarGoogle Scholar
  5. Jose Flich, Giovanni Agosta, Philipp Ampletzer, David Atienza Alonso, Alessandro Cilardo, William Fornaciari, Mario Kovac, Fabrice Roudet, and Davide Zoni. 2015. The MANGO FET-HPC Project: An Overview. In Computational Science and Engineering (CSE), 2015 IEEE 18th International Conference on. IEEE, 351--354. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Morris Jette and Mark Grondona. 2003. SLURM: Simple Linux Utility for Resource Management. In ClusterWorld Conference and Expo.Google ScholarGoogle Scholar
  7. Khronos OpenCL Working Group. 2014. The OpenCL Specification, Version 1.2. https://www.khronos.org/registry/cl/specs/opencl-1.2.pdf. Aaftab Munshi eds.Google ScholarGoogle Scholar
  8. Khronos OpenCL Working Group -- SYCL subgroup. 2014. SYCL™ Specification, Version 1.2. https://www.khronos.org/registry/sycl/specs/sycl-1.2.pdf. Lee Howes and Maria Rovatsou eds.Google ScholarGoogle Scholar
  9. Bastian Koller, Nico Struckmann, Jochen Buchholz, and Michael Gienger. 2015. Towards an Environment to Deliver High Performance Computing to Small and Medium Enterprises. Springer International Publishing, Cham, 41--50.Google ScholarGoogle Scholar
  10. G. Massari, E. Paone, P. Bellasi, G. Palermo, V. Zaccaria, W. Fornaciari, and C. Silvano. 2014. Combining application adaptivity and system-wide Resource Management on multi-core platforms. In 2014 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XIV). 26--33.Google ScholarGoogle Scholar
  11. Microsoft Corporation. 2013. C++ AMP: C++ Accelerated Massive Parallelism, Version 1.2. http://download.microsoft.com/download/4/0/E/40EA02D8-23A7-4BD2-AD3A-0BFFFB640F28/CppAMPLanguageAndProgrammingModel.pdf.Google ScholarGoogle Scholar
  12. John Nickolls, Ian Buck, Michael Garland, and Kevin Skadron. 2008. Scalable Parallel Programming with CUDA. ACM Queue 6, 2 (2008), 40--53. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. nVidia Corp. 2008. CUDA Technology. http://www.nvidia.com/CUDA. (September 2008).Google ScholarGoogle Scholar
  14. A. Pupykina and G. Agosta. 2017. Optimizing Memory Management in Deeply Heterogeneous HPC Accelerators. In 2017 46th International Conference on Parallel Processing Workshops (ICPPW). 291--300.Google ScholarGoogle Scholar
  15. Ehsan Totoni, Babak Behzad, Swapnil Ghike, and Josep Torrellas. 2012. Comparing the Power and Performance of Intel's SCC to State-of-the-art CPUs and GPUs. In Proceedings of the 2012 IEEE International Symposium on Performance Analysis of Systems & Software (ISPASS '12). IEEE Computer Society, Washington, DC, USA, 78--87. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Sandra Wienke, Paul Springer, Christian Terboven, and Dieter an Mey. 2012. OpenACC: First Experiences with Real-world Applications. In Proceedings of the 18th International Conference on Parallel Processing (Euro-Par' 12). Springer-Verlag, Berlin, Heidelberg, 859--870. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Managing Heterogeneous Resources in HPC Systems

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            PARMA-DITAM '18: Proceedings of the 9th Workshop and 7th Workshop on Parallel Programming and RunTime Management Techniques for Manycore Architectures and Design Tools and Architectures for Multicore Embedded Computing Platforms
            January 2018
            76 pages
            ISBN:9781450364447
            DOI:10.1145/3183767

            Copyright © 2018 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 23 January 2018

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Research
            • Refereed limited

            Acceptance Rates

            Overall Acceptance Rate11of24submissions,46%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader