skip to main content
10.1145/3587135.3592174acmconferencesArticle/Chapter ViewAbstractPublication PagescfConference Proceedingsconference-collections
invited-talk

Adaptive multi-tier intelligent data manager for Exascale

Authors Info & Claims
Published:04 August 2023Publication History

ABSTRACT

The main objective of the ADMIRE project1 is the creation of an active I/O stack that dynamically adjusts computation and storage requirements through intelligent global coordination, the elasticity of computation and I/O, and the scheduling of storage resources along all levels of the storage hierarchy, while offering quality-of-service (QoS), energy efficiency, and resilience for accessing extremely large data sets in very heterogeneous computing and storage environments. We have developed a framework prototype that is able to dynamically adjust computation and storage requirements through intelligent global coordination, separated control, and data paths, the malleability of computation and I/O, the scheduling of storage resources along all levels of the storage hierarchy, and scalable monitoring techniques. The leading idea in ADMIRE is to co-design applications with ad-hoc storage systems that can be deployed with the application and adapt their computing and I/O behaviour on runtime, using malleability techniques, to increase the performance of applications and the throughput of the applications.

References

  1. Jean Luca Bez, Alberto Miranda, Ramon Nou, Francieli Zanon Boito, Toni Cortes, and Philippe O. A. Navaux. 2021. Arbitration Policies for On-Demand User-Level I/O Forwarding on HPC Platforms. In 35th IEEE International Parallel and Distributed Processing Symposium, IPDPS 2021, Portland, OR, USA, May 17-21, 2021. IEEE, 577--586.Google ScholarGoogle ScholarCross RefCross Ref
  2. André Brinkmann, Kathryn Mohror, Weikuan Yu, Philip H. Carns, Toni Cortes, Scott Klasky, Alberto Miranda, Franz-Josef Pfreundt, Robert B. Ross, and Marc-Andre Vef. 2020. Ad Hoc File Systems for High-Performance Computing. J. Comput. Sci. Technol. 35, 1 (2020), 4--26.Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Félix Garcia-Carballeira, Alejandro Calderon, Jesus Carretero, Javier Fernandez, and Jose M Perez. 2003. The design of the Expand parallel file system. The International Journal of High Performance Computing Applications 17, 1 (2003), 21--37.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Florin Isaila, Javier Garcia-Blas, Jesus Carretero, Rob Ross, and Dries Kimpe. 2017. Making the case for reforming the I/O software stack of extreme-scale systems. Advances in Engineering Software 111 (2017), 26--31. Advances in High Performance Computing: on the path to Exascale software.Google ScholarGoogle ScholarCross RefCross Ref
  5. Raffaele Montella, Diana Di Luccio, Pasquale Troiano, Angelo Riccio, Alison Brizius, and Ian Foster. 2016. WaComM: A parallel Water quality Community Model for pollutant transport and dispersion operational predictions. In 2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS). IEEE, 717--724.Google ScholarGoogle ScholarCross RefCross Ref
  6. Francisco José Rodrigo Duro, Fabrizio Marozzo, Javier García Blas, Jesús Carretero Pérez, Domenico Talia, and Paolo Trunfio. 2015. Evaluating data caching techniques in DMCF workflows using Hercules. (2015).Google ScholarGoogle Scholar
  7. M. Vef, N. Moti, T. Süß, T. Tocci, R. Nou, A. Miranda, T. Cortes, and A. Brinkmann. 2018. GekkoFS - A Temporary Distributed File System for HPC Applications. In 2018 IEEE International Conference on Cluster Computing. 319--324.Google ScholarGoogle Scholar
  8. Chen Wang. 2022. Detecting Data Races on Relaxed Systems Using Recorder.Google ScholarGoogle Scholar
  9. Chen Wang, Kathryn Mohror, and Marc Snir. 2021. File System Semantics Requirements of HPC Applications. In HPDC '21: The 30th International Symposium on High-Performance Parallel and Distributed Computing, Virtual Event. ACM.Google ScholarGoogle Scholar

Index Terms

  1. Adaptive multi-tier intelligent data manager for Exascale

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          CF '23: Proceedings of the 20th ACM International Conference on Computing Frontiers
          May 2023
          419 pages
          ISBN:9798400701405
          DOI:10.1145/3587135

          Copyright © 2023 Owner/Author

          Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 4 August 2023

          Check for updates

          Qualifiers

          • invited-talk
          • Research
          • Refereed limited

          Acceptance Rates

          CF '23 Paper Acceptance Rate24of66submissions,36%Overall Acceptance Rate240of680submissions,35%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader