Compositional Approach Applied to Loop Specialization

Djoudi, Lamia; Acquaviva, Jean-Thomas; Barthou, Denis

doi:10.1007/978-3-540-74466-5_30

Lamia Djoudi¹,
Jean-Thomas Acquaviva¹ &
Denis Barthou¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4641))

Included in the following conference series:

European Conference on Parallel Processing

750 Accesses
1 Citations

Abstract

An optimizing compiler has a hard time to generate a code which will perform at top speed for an arbitrary data set size. In general, the low level optimization process must take into account parameters such as loop trip count for generating efficient code. The code can be specialized depending upon data set size ranges, at the expense of code expansion and decision tree overhead.

Download to read the full chapter text

Chapter PDF

Distributing and Parallelizing Non-canonical Loops

Pure Functions in C: A Small Keyword for Automatic Parallelization

Article Open access 30 May 2020

An Effective Framework of Program Optimization for High Performance Computing

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Schwiegelshohn, U., Gasperoni, F., Ebcioglu, K.: On Optimal Parallelization of Arbitrary Loops. Journal of Parallel and Distributed Computing 11, 130–134 (1991)
Article Google Scholar
Darte, A., Robert, Y.: Affine-by-statement scheduling of uniform and affine loop nests over parametric domains. Journal of Parallel and Distributed Computing 29(1), 43–59 (1995)
Article Google Scholar
Rau, B.R.: Iterative modulo scheduling: an algorithm for software pipelining loops. In: Int. Symp. on Microarchitecture, San Jose, California, United States, pp. 63–74. ACM Press, New York (1994)
Google Scholar
Griebl, M., Feautrier, P., Lengauer, C.: Index set splitting. Int. Journal of Parallel Programming 28(6), 607–631 (2000)
Article Google Scholar
Djoudi, L., Barthou, D., Carribault, P., Lemuet, C., Acquaviva, J.T., Jalby, W.: Exploring application performance: a new tool for a static / dynamic approach. In: LACSI Los Alamos Computer Science Institute Symposium (2005)
Google Scholar
Cooper, K., Dasgupta, A., Kennedy, K.: Vizer: A system to vectorize intel x86 binaries. In: LACSI Los Alamos Computer Science Institute Symposium (December 2002)
Google Scholar
Merten, M., Thiems, M.: An overview of the IMPACT x86 binary reoptimization framework. Technical report (July 1998)
Google Scholar
Larus, J., Schnarr, E.: EEL: Machine-independent executable editing. In: Int. Conf. on Programming Language Design and Implementation, pp. 291–300 (1995)
Google Scholar
McNairy, C., Soltis, D.: Itanium 2 processor microarchitecture. IEEE Micro 23(2), 44–55 (2003)
Article Google Scholar
Allan, V.H., Jones, R.B., Lee, R.M., Allan, S.J.: Software pipelining. ACM Computing Surveys 27(3), 367–432 (1995)
Article Google Scholar
Doshi, G., Krishnaiyer, R., Muthukumar, K.: Optimizing software data prefetches with rotating registers. In: Int. Conf. on Parallel Architectures and Compilation Techniques, Barcelona, Catalunya, Spain, IEEE Computer Society Press, Los Alamitos (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Université de Versailles, France
Lamia Djoudi, Jean-Thomas Acquaviva & Denis Barthou

Authors

Lamia Djoudi
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Thomas Acquaviva
View author publications
You can also search for this author in PubMed Google Scholar
Denis Barthou
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Anne-Marie Kermarrec Luc Bougé Thierry Priol

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Djoudi, L., Acquaviva, JT., Barthou, D. (2007). Compositional Approach Applied to Loop Specialization. In: Kermarrec, AM., Bougé, L., Priol, T. (eds) Euro-Par 2007 Parallel Processing. Euro-Par 2007. Lecture Notes in Computer Science, vol 4641. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74466-5_30

Download citation

DOI: https://doi.org/10.1007/978-3-540-74466-5_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74465-8
Online ISBN: 978-3-540-74466-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Compositional Approach Applied to Loop Specialization

Abstract

Chapter PDF

Similar content being viewed by others

Distributing and Parallelizing Non-canonical Loops

Pure Functions in C: A Small Keyword for Automatic Parallelization

An Effective Framework of Program Optimization for High Performance Computing

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Compositional Approach Applied to Loop Specialization

Abstract

Chapter PDF

Similar content being viewed by others

Distributing and Parallelizing Non-canonical Loops

Pure Functions in C: A Small Keyword for Automatic Parallelization

An Effective Framework of Program Optimization for High Performance Computing

Keywords

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation