Abstract
Simulation software on today’s HPC systems needs to be scalable to a large number of processes to make efficient use of such machines. Already at this level and especially with the expected increasing scalability of upcoming machines, the development and debugging of parallel programs becomes an increasingly difficult task. Consequently, sophisticated tools providing mechanisms for handling large-scale parallel and distributed programs are needed. In this paper we show several ways to improve the handling of large event traces using the Trace Viewer plug-in of the g-Eclipse tool and we propose the use of a pattern matching technique to simplify the debugging of large message passing parallel programs. With the pattern matching approach, we enable an additional layer of abstraction, which supports the user in understanding the program’s behaviour.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Christof Klausecker, Thomas Köckerbauer, Robert Preissl, and Dieter Kranzlmüller. Debugging MPI programs on the grid using g-Eclipse. In Michael Resch, Rainer Keller, Valentin Himmler, Bettina Krammer, and Alexander Schulz, editors, Tools for High Performance Computing, Proceedings of the 2nd International Workshop on Parallel Tools for High Performance Computing, pages 35–45, Stuttgart, July 2008. HLRS, Springer-Verlag.
Andreas Knüpfer, Ronny Brendel, Holger Brunst, Hartmut Mix, and Wolfgang E. Nagel. Introducing the open trace format (OTF). In ICCS 2006, volume 3992/2006 of Lecture Notes in Computer Science, pages 526–533. Springer Berlin / Heidelberg, 2006.
Harald Kornmayer, Mathias Stümpert, Markus Knauer, and Pawel Wolniewicz. g-Eclipse - an integrated workbench tool for grid application users, grid operators and grid application developers. In Cracow Grid Workshop ’06, Cracow, Poland, October 2006.
Bettina Krammer, Katrin Bidmon, Matthias S. Müller, and Michael M. Resch. MARMOT: An MPI analysis and checking tool. In ParCo, pages 493–500, 2003.
Dieter Kranzlmüller. Event Graph Analysis for Debugging Massively Parallel Programs. PhD thesis, Johannes Kepler University Linz, September 2000.
Dieter Kranzlmüller, Siegfried Grabner, and Jens Volkert. Event graph visualization for debugging large applications. In SPDT ’96: Proceedings of the SIGMETRICS symposium on Parallel and distributed tools, pages 108–117, New York, NY, USA, 1996. ACM.
Dieter Kranzlmüller, Michael Scarpa, and Jens Volkert. DeWiz - a modular tool architecture for parallel program analysis. In Euro-Par, pages 74–80, 2003.
Dieter Kranzlmüller and Jens Volkert. NOPE: A nondeterministic program evaluator. In ParNum ’99: Proceedings of the 4th International ACPC Conference Including Special Tracks on Parallel Numerics and Parallel Computing in Image Processing, Video Processing, and Multimedia, pages 490–499, London, UK, 1999. Springer-Verlag.
Chao Ma, Yong Meng Teo, Verdi March, Naixue Xiong, Ioana Romelia Pop, Yan Xiang He, and Simon See. An approach for matching communication patterns in parallel applications. In IPDPS 2009, 2009.
Beniamino Di Martino, Antonino Mazzeo, Nicola Mazzocca, and Umberto Villano. Parallel program analysis and restructuring by detection of point-to-point interaction patterns and their transformation into collective communication constructs. Science of Computer Programming, 40(2-3):235–261, 2001.
Wolfgang E. Nagel, Alfred Arnold, Michael Weber, Hans-Christian Hoppe, and Karl Solchenbach. VAMPIR: Visualization and analysis of MPI resources. Supercomputer, 12(1):69–80, Jan 1996.
Robert Preissl, Thomas Köckerbauer, Martin Schulz, Dieter Kranzlmüller, Bronis R. de Supinski, and Daniel J. Quinlan. Detecting patterns in MPI communication traces. International Conference on Parallel Processing, 0:230–237, 2008.
Robert Preissl, Martin Schulz, Dieter Kranzlmüller, Bronis R. de Supinski, and Daniel J. Quinlan. Using MPI communication patterns to guide source code transformations. In Marian Bubak, G. Dick van Albada, Jack Dongarra, and Peter M. A. Sloot, editors, ICCS (3), volume 5103 of Lecture Notes in Computer Science, pages 253–260. Springer, 2008.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Köckerbauer, T., Klausecker, C., Kranzlmüller, D. (2010). Scalable Parallel Debugging with g-Eclipse. In: Müller, M., Resch, M., Schulz, A., Nagel, W. (eds) Tools for High Performance Computing 2009. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11261-4_8
Download citation
DOI: https://doi.org/10.1007/978-3-642-11261-4_8
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-11260-7
Online ISBN: 978-3-642-11261-4
eBook Packages: Computer ScienceComputer Science (R0)