Abstract
In this chapter, we propose a linear data distribution technique, which extends the traditional BLOCK or CYCLIC distribution for intra-dimension as in HPF, to permit partitioning the array elements along slant lines. The array distribution patterns are determined by analyzing the array subscript references in loop nests. If the data are distributed along the slant lines, then we show the conversion algorithm between global address and local address, and the conversion algorithm from global iteration space to local iteration space.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Adve, V. S., Mellor-Crummey, J., Anderson, M., Kennedy, K., Wang, J.-C., and Reed, D. A.: An Integrated Compilation and Performance Analysis Environment for Data Parallel Programs, Proceedings of Supercomputing’95, San Diego, CA, Dec. 1995.
Amarasinghe, S. P. and Lam, M. S.: Communication Optimization and Code Generation for Distributed Memory Machines, Proceedings of the ACM SIGPLAN’93 Conference on Programming Language Design and Implementation, Albuquerque, NM, June 1993.
Anderson, J. M. and Lam, M. S.: Global Optimizations for Parallelism and Locality on Scalable Parallel Machines, Proc. of the SIGPLAN ‘83 Conference on Program Language Design and Implementation, ACM, 1993, pp. 112–125.
Balasundaram, V., Fox, G., Kennedy, K., and Kremer, U.: An Interactive Environment for Data Partitioning and distribution, Proc. Fifth Distributed Memory Computing Conference, 1990.
Balasundaram, V., Fox, G., Kennedy, K., and Kremer, U.: An Static Performance Estimator to Guide Data Partitioning Decisions, Proceedings of the Third ACMSIGPLANSymposium on Principles and Practice of Parallel Programming, Williamsburg, VA, Apr. 1991.
Banerjee, P., Chandy, J. A., Gupta, M., Hodges IV, E., Holm, J., Lain, A., Palermo, D., Ramaswamy, S., and Su, E.: The PARADIGM Compiler for Distributed-memory Multicomputers, IEEE Comput, Vol. 28 (1995), pp. 37–47.
Banerjee, U.: Loop Parallelization, A Book Series on Loop Transformations for Restructuring Compilers, Kluwer Academic Publishers, 1994.
Bau, D., Koduklula, I., Kotlyar, V., Pingali, K., and Stodghill, P.: Solving Alignment Using Elementary Linear Algebra, Proceedings of the 7th Workshop on Languages and Compilers for Parallel Computing, Ithica, NY, 1994, Springer-Verlag, 1995.
Chen, T.-S. and Sheu, J.-P.: Communication-Free Data Allocation Techniques for Parallelizing Compilers on Multicomputers, IEEE Transactions on Parallel and Distributed Systems, Vol. 5,No. 9 (1994), pp. 924–938.
Guo, M., Yamashita, Y., and Nakata, I.: An Efficient Data Distribution Technique for Distributed Memory Parallel Computers, Transactions of Information Processing Society of Japan, Vol. 39,No. 6, pp. 1718–1728 (1998).
Guo, M., Yamashita, Y., and Nakata, I.: Efficient Implementation of Multi-Dimensional Array Redistribution, lEICE Transactions on Information and Systems, Vol. E81-D, No. 11, pp. 1195–1204 (1998).
Guo, M., Yamashita, Y., and Nakata, I.: Improving Performance of Multi-dimensional Array Redistribution on Distributed Memory Machines, Proceedings of the Third International Workshop on High-Level Parallel Programming Models and Supportive Environments, Orlando, FL, USA, March 1998.
Gupta, M.: Automatic Data Partitioning on Distributed Memory Multicomputers, PhD Thesis, University of Illinois, Urbana-Champaign, Sep. 1992.
Gupta, M. and Banerjee, P.: Compile-time Estimation of Communication Costs on Multicomputers, Proceedings of the Sixth International Parallel Processing Symposium, Beverly Hills, CA, March 1992.
HPF Forum: High Performance Fortran Language Specification,Rice University, Houston, Texas, version 2.0 edition, Nov. 1996.
Kennedy, K. and Kremer, U.: Automatic Data Layout for High Performance Fortran, Proceedings of Supercomputing’95, San Diego, CA, Dec. 1995.
Ramanujam, J. and Sadayappan, P.: Compile-Time Techniques for Data Distribution in Distributed Memory Machines, IEEE Transactions on Parallel and Distributed Systems, Vol. 2,No. 4 (1991), pp. 472–481.
Wolfe, M.: High Performance Compilers for Parallel Computing, Addison-Wesley, 1995.
Zima, H. and Chapman, B.: Supercompilers for Parallel and Vector Computers, Frontier Series, Addison-Wesley, 1990.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer Science+Business Media New York
About this chapter
Cite this chapter
Guo, M. (2004). Linear Data Distribution based on Index Analysis. In: Yang, L.T., Pan, Y. (eds) High Performance Scientific and Engineering Computing. The Springer International Series in Engineering and Computer Science, vol 750. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-5402-5_2
Download citation
DOI: https://doi.org/10.1007/978-1-4757-5402-5_2
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-5389-6
Online ISBN: 978-1-4757-5402-5
eBook Packages: Springer Book Archive