Parallel Algorithm for Quasi-Band Matrix-Matrix Multiplication

Vooturi, Dharma Teja; Kothapalli, Kishore

doi:10.1007/978-3-319-32149-3_11

Parallel Algorithm for Quasi-Band Matrix-Matrix Multiplication

Dharma Teja Vooturi⁷ &
Kishore Kothapalli⁷

Conference paper
First Online: 02 April 2016

1272 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9573))

Abstract

Sparse matrices arise in many practical scenarios. As a result, support for efficient operations such as multiplication of sparse matrices (spmm) is considered to be an important research area. Often, sparse matrices also exhibit particular characteristics that can be used towards better parallel algorithmics. In this paper, we focus on quasi-band sparse matrices that have a large majority of the non-zeros along the diagonals. We design and implement an efficient algorithm for multiplying two such matrices on a many-core architecture such as a GPU.

Our implementation outperforms the corresponding library implementation by a factor of 2x on average over a wide variety of quasi-band matrices from standard datasets. We analyze our performance over synthetic quasi-band matrices.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Bell, N., Garland, M.: Implementing sparse matrix-vector multiplication on throughput-oriented processors. In: Proceeding SuperComputing (SC), pp. 1–11 (2009)
Google Scholar
Buluc, A., Gilbert, J.R.: Challenges and advances in parallel sparse matrix-matrix multiplication. In: Proceeding International Conference on Parallel Processing, pp. 503–510 (2008)
Google Scholar
Gharaibeh, A., Costa, B., Santos-Neto, E., Ripeanu, M.: On Graphs, GPUs, and Blind Dating: a workload to processor matchmaking quest. In: Proceeding International Parallel & Distributed Processing Symposium (IPDPS), pp. 851–862 (2013)
Google Scholar
Hong, S., Rodia, N.C., Olukotun, K.: On fast parallel detection of strongly connected components in small-world graphs. In: Proceedings of the SC (2013). Article No. 92
Google Scholar
Indarapu, S., Maramreddy, M., Kothapalli, K.: Architecture- and workload-aware algorithms for spare matrix- vector multiplication. In: Proceeding of ACM India Computing Conference (2014). Article No. 3
Google Scholar
Liu, W., Vinter, B.: An efficient GPU general sparse matrix-matrix multiplication for irregular data. In: Proceeding of IPDPS, pp. 370–381 (2014)
Google Scholar
Ramamoorthy, K.R., Banerjee, D.S., Srinathan, K., Kothapalli, K.: A novel heterogeneous algorithm for multiplying scale-free sparse matrices. In: Proceeding of IPDPS Workshops, pp. 637–646 (2015)
Google Scholar
Nvidia sparse matrix library (cuSPARSE). http://developer.nvidia.com/cusparse
Intel Math Kernel Library. https://software.intel.com/en-us/articles/intel-mkl/
University of Florida UF sparse matrix collection (2011). http://www.cise.ufl.edu/research/sparse/matrices/groups.html
Yang, W., Li, K., Liu, Y., Shi, L., Wan, L.: Optimization of quasi-diagonal matrix-vector multiplication on GPU. Int. J. High Perform. Comput. Appl. 28(2), 183–195 (2014)
Article Google Scholar

Download references

Author information

Authors and Affiliations

International Institute of Information Technology, Hyderabad, Gachibowli, Hyderabad, 500032, India
Dharma Teja Vooturi & Kishore Kothapalli

Authors

Dharma Teja Vooturi
View author publications
You can also search for this author in PubMed Google Scholar
Kishore Kothapalli
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dharma Teja Vooturi .

Editor information

Editors and Affiliations

Czestochowa University of Technolog, Czestochowa, Poland
Roman Wyrzykowski
Department of Computer Science, University of Southern California, Marina Del Rey, California, USA
Ewa Deelman
Electrical Engineering & Comput. Science, University of Tennessee, Knoxville, Tennessee, USA
Jack Dongarra
Czestochowa University of Technology, Institute of Computer & Information Sci., Czestochowa, Poland
Konrad Karczewski
Department of Computer Science, AGH University of Science and Technology, Krakow, Poland
Jacek Kitowski
Systèmes d’informations, Big Data et Rec, AGH University of Science and Technology, Krakow, Poland
Kazimierz Wiatr

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vooturi, D.T., Kothapalli, K. (2016). Parallel Algorithm for Quasi-Band Matrix-Matrix Multiplication. In: Wyrzykowski, R., Deelman, E., Dongarra, J., Karczewski, K., Kitowski, J., Wiatr, K. (eds) Parallel Processing and Applied Mathematics. PPAM 2015. Lecture Notes in Computer Science(), vol 9573. Springer, Cham. https://doi.org/10.1007/978-3-319-32149-3_11

Download citation

DOI: https://doi.org/10.1007/978-3-319-32149-3_11
Published: 02 April 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-32148-6
Online ISBN: 978-3-319-32149-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics