Skip to main content
Log in

High Performance Computing on Heterogeneous Clusters with the Madeleine II Communication Library

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

This paper introduces Madeleine II, a new adaptive and portable multi-protocol implementation of the Madeleine communication library. Madeleine II has the ability to control multiple network interfaces (BIP, SISCI, VIA) and multiple network adapters (Ethernet, Myrinet, SCI) within the same application session. We report on performance measurements obtained using BIP/Myrinet and SISCI/SCI and we present preliminary results about our MPICH/Madeleine II and Nexus/Madeleine II ports. We also discuss an extension of Madeleine II for clusters of clusters which is able to handle heterogeneous networks. In particular, we present the fast internal data-forwarding mechanism that is used on gateway nodes to speed up inter-cluster transmissions. Preliminary experiments show that the resulting inter-cluster bandwidth is close to the one delivered by the hardware.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. I. Foster, C. Kesselman and S. Tuecke, The Nexus approach to integrating multithreading and communications, Journal on Parallel and Distributed Computing 37(1) (1996) 70–82.

    Google Scholar 

  2. R. Namyst and J.-F. Méhaut, PM2: Parallel Multithreaded Machine; a computing environment for distributed architectures, in: Parallel Computing (ParCo'95) (Elsevier, Amsterdam, 1995) pp. 279–285.

    Google Scholar 

  3. L. Bougé, J.-F. Méhaut and R.-Namyst, Efficient communications in multithreaded runtime systems, in: Parallel and Distributed Processing. Proc. 3rd Workshop on Runtime Systems for Parallel Programming (RTSPP'99), Lecture Notes in Computer Science, Vol. 1586 (Springer, San Juan, Puerto Rico, 1999) pp. 468–482.

    Google Scholar 

  4. J. Worringen and T. Bemmerl, MPICH for SCI-connected clusters, in: SCI Europe'99 (Bordeaux, France, 1999) pp. 3–11.

    Google Scholar 

  5. D. Dunning, G. Regnier, G. McAlpine, D. Cameron, B. Shubert, F. Berry, A.-M. Merritt, E. Gronke and C. Dodd, The virtual interface architecture, IEEE Micro (March 1998) 66–75.

  6. E. Lusk and W. Gropp. MPICH working note: The second-generation ADI for the MPICH implementation ofMPI, Technical report, Argonne National Laboratory (1996).

  7. S. Pakin, V. Karamcheti and A. Chien, Fast Messages: Efficient, portable communication for workstation clusters and MPPs, IEEE Concurrency 5(2) (1997) 60–73.

    Google Scholar 

  8. H. Hellwagner and A. Reinefeld, eds., SCI: Scalable Coherent Interface, Architecture and Software for High-Performance Compute Clusters, Lecture Notes in Computer Science, State-of-the-Art Surveys, Vol. 1734 (Springer, Amsterdam, 1999).

    Google Scholar 

  9. L. Prylli and B. Tourancheau, BIP: a new protocol designed for high performance networking on Myrinet, in: 1st Workshop on Personal Computer Based Networks Of Workstations (PC-NOW'98), Lecture Notes in Computer Science, Vol. 1388 (Springer, Amsterdam, 1998) pp. 472–485.

    Google Scholar 

  10. G. Mercier, Support efficace de l'hétérogénéité des réseaux dans MPI, Master's thesis report, DEA d'Informatique Fondamentale, ENS Lyon, France (June 2000).

    Google Scholar 

  11. A. Denis, Adaptation de l'environnemnet générique de metacomputing Globus à des réseaux haut débit, Master's thesis report, DEA d'Informatique Fondamentale, ENS Lyon, France (June 2000).

    Google Scholar 

  12. Sca-MPI, http://www.scali.com/.

  13. I. Foster, J. Geisler, C. Kesselman and S. Tuecke, Managing multiple communication methods in high-performance networked computing systems, Journal on Parallel and Distributed Computing 40 (1997) 35–48.

    Google Scholar 

  14. PACX-MPI, http://www.hlrs.de/structure/ organisation/par/projects/pacx-mpi/.

  15. R. Russell and P. Hatcher, Efficient kernel support for reliable communication, in: 13th ACM Symposium on Applied Computing (Atlanta, GA, 1998) pp. 541–550.

  16. V. Danjean, R. Namyst and R. Russell, Integrating kernel activations in a multithreaded runtime system on Linux, in: Proc. 4th Workshop on Runtime Systems for Parallel Processing (RTSPP'00), Lecture Notes in Computer Science, Vol. 1800 (Springer, Cancun, Mexico, 2000) pp. 1160–1167.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Aumage, O., Bougé, L., Eyraud, L. et al. High Performance Computing on Heterogeneous Clusters with the Madeleine II Communication Library. Cluster Computing 5, 43–54 (2002). https://doi.org/10.1023/A:1012792605138

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1012792605138

Navigation