S-MPEC: Sparse Matrix Multiplication Performance Estimator on a Cloud Environment

Park, Jueon; Lee, Kyungyong

doi:10.1007/s10586-021-03287-3

S-MPEC: Sparse Matrix Multiplication Performance Estimator on a Cloud Environment

Published: 12 May 2021

Volume 26, pages 2563–2576, (2023)
Cite this article

Cluster Computing Aims and scope Submit manuscript

248 Accesses
1 Citation
Explore all metrics

Abstract

Sparse matrix multiplication (SPMM) is widely used for various machine learning algorithms. As the applications of SPMM using large-scale datasets become prevalent, executing SPMM jobs on an optimized setup has become very important. Execution environments of distributed SPMM tasks on cloud resources can be set up in diverse ways with respect to the input sparse datasets, distinct SPMM implementation methods, and the choice of cloud instance types. In this paper, we propose S-MPEC which can predict latency to complete various SPMM tasks using Apache Spark on distributed cloud environments. We first characterize various distributed SPMM implementations on Apache Spark. Considering the characters and hardware specifications on the cloud, we propose unique features to build a GB-regressor model and Bayesian optimizations. Our proposed S-MPEC model can predict latency on an arbitrary SPMM task accurately and recommend an optimal implementation method. Thorough evaluation of the proposed system reveals that a user can expect 44% less latency to complete SPMM tasks compared with the native SPMM implementations in Apache Spark.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Big data analytics on Apache Spark

Article 13 October 2016

Big data preprocessing: methods and prospects

Article Open access 01 November 2016

Big data analytics: a survey

Article Open access 01 October 2015

Notes

https://github.com/kmu-bigdata/spark-spmm-compute.

References

Alipourfard, O., Liu, H.H., Chen, J., Venkataraman, S., Yu, M., Zhang, M.: Cherrypick: adaptively unearthing the best cloud configurations for big data analytics. In: 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI 17). USENIX Association, Boston, pp. 469–482 (2017). https://www.usenix.org/conference/nsdi17/technical-sessions/presentation/alipourfard
Bosagh Zadeh, R., Meng, X., Ulanov, A., Yavuz, B., Pu, L., Venkataraman, S., Sparks, E., Staple, A., Zaharia, M.: Matrix Computations and Optimization in Apache Spark, Ser. KDD ’16. ACM, New York, pp. 31–38 (2016)
Breiman, L.: Random forests. Mach. Learn. 45(1), 5–32 (2001)
Article MATH Google Scholar
Chen, D., Plemmons, R.J.: Nonnegativity constraints in numerical analysis. In: Bultheel, A., Cools, R. (eds.) Symposium on the Birth of Numerical Analysis. World Scientific Press, Singapore (2009)
Google Scholar
Cheng, Y., Iqbal, M.S., Gupta, A., Butt, A.R.: Cast: Tiering storage for data analytics in the cloud. In: Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, ser. HPDC ’15. ACM, New York, pp. 45–56 (2015). https://doi.org/10.1145/2749246.2749252
Choi, J., Dongarra, J.J., Pozo, R., Walker, D.W.: Scalapack: a scalable linear algebra library for distributed memory concurrent computers. In: Proceedings of Fourth Symposium on the Frontiers of Massively Parallel Computation, vol. 1992, pp. 120–127 (1992)
Dean, J., Ghemawat, S.: Mapreduce: simplified data processing on large clusters. In: Proceedings of the 6th Conference on Symposium on Operating Systems Design & Implementation, vol. 6, ser. OSDI’04. USENIX Association, Berkeley, p. 10 (2004). http://dl.acm.org/citation.cfm?id=1251254.1251264
Demmel, J., Eliahu, D., Fox, A., Kamil, S., Lipshitz, B., Schwartz, O., Spillinger, O.: Communication-optimal parallel recursive rectangular matrix multiplication. In: Proceedings of the 2013 IEEE 27th International Symposium on Parallel and Distributed Processing, ser. IPDPS ’13. IEEE Computer Society, Washington, DC, pp. 261–272 (2013). https://doi.org/10.1109/IPDPS.2013.80
Elith, J., Leathwick, J.R., Hastie, T.: A working guide to boosted regression trees. J. Anim. Ecol. 77(4), 802–813 (2008)
Article Google Scholar
Foldi, T., von Csefalvay, C., Perez, N.A.: Jampi: efficient matrix multiplication in spark using barrier execution mode. Big Data Cogn. Comput. 4, 32 (2020). https://doi.org/10.3390/bdcc4040032
Article Google Scholar
Foundation, A.S.: Apache hadoop (2004). http://hadoop.apache.org/
Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Ann. Stat. 29(5), 1189–1232 (2001). https://doi.org/10.1214/aos/1013203451
Article MathSciNet MATH Google Scholar
Gu, R., Tang, Y., Wang, Z., Wang, S., Yin, X., Yuan, C., Huang, Y.: Efficient large scale distributed matrix computation with spark. In: 2015 IEEE International Conference on Big Data (Big Data), October 2015, pp. 2327–2336 (2015)
Herodotou, H., Babu, S.: Profiling, what-if analysis, and cost-based optimization of mapreduce programs. PVLDB 4(11), 1111–1122 (2011)
Google Scholar
Huss-Lederman, S., Jacobson, E.M., Johnson, J.R., Tsao, A., Turnbull, T.: Implementation of Strassen’s algorithm for matrix multiplication. In: Supercomputing ’96:Proceedings of the 1996 ACM/IEEE Conference on Supercomputing, pp. 32–32 (1996)
Jalaparti, V., Ballani, H., Costa, P., Karagiannis, T., Rowstron, A.: Bridging the tenant-provider gap in cloud services. In: Proceedings of the Third ACM Symposium on Cloud Computing, ser. SoCC ’12. ACM, New York, pp. 10:1–10:14 (2012). http://doi.acm.org/10.1145/2391229.2391239
Kepner, J., Gilbert, J.: Graph Algorithms in the Language of Linear Algebra. Society for Industrial and Applied Mathematics (2011). https://doi.org/10.1137/1.9780898719918
Kim, J., Lee, K.: Functionbench: a suite of workloads for serverless cloud function service. In: 2019 IEEE 12th International Conference on Cloud Computing (CLOUD), July 2019, pp. 502–504 (2019)
Kim, J., Son, M., Lee, K.: Mpec: Distributed matrix multiplication performance modeling on a scale-out cloud environment for data mining jobs. IEEE Trans. Cloud Comput. (2019). https://doi.org/10.1109/tcc.2019.2950400
Article Google Scholar
Klimovic, A., Litz, H., Kozyrakis, C.: Selecta: Heterogeneous cloud storage configuration for data analytics. In: 2018 USENIX Annual Technical Conference (USENIX ATC 18). USENIX Association, Boston, July 2018, pp. 759–773 (2018). https://www.usenix.org/conference/atc18/presentation/klimovic-selecta
Langr, D., Simecek, I.: Analysis of memory footprints of sparse matrices partitioned into uniformly-sized blocks. Scalable Comput. Pract. Exp. 19(3), 275–292 (2018)
Article Google Scholar
Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. In: NIPS. MIT Press, Cambridge, pp. 556–562 (2000)
Lee, H.-J., Robertson, J.P., Fortes, J.A.B.: Generalized cannon’s algorithm for parallel matrix multiplication. In: Proceedings of the 11th International Conference on Supercomputing, ser. ICS ’97. Association for Computing Machinery, New York, pp. 44–51 (1997). https://doi.org/10.1145/263580.263591
Leskovec, J., Krevl, A.: SNAP datasets: Stanford large network dataset collection. http://snap.stanford.edu/data, June 2014
Meng, X., Bradley, J., Yavuz, B., Sparks, E., Venkataraman, S., Liu, D., Freeman, J., Tsai, D., Amde, M., Owen, S., Xin, D., Xin, R., Franklin, M.J., Zadeh, R., Zaharia, M., Talwalkar, A.: Mllib: Machine learning in apache spark. J. Mach. Learn. Res. 17(1), 1235–1241 (2016)
MathSciNet MATH Google Scholar
Misra, C., Bhattacharya, S., Ghosh, S.K.: Stark: fast and scalable Strassen’s matrix multiplication using apache spark. IEEE Trans. Big Data (2020). https://doi.org/10.1109/TBDATA.2020.2977326
Article Google Scholar
Nguyen Binh Duong, T.A.: FC2: cloud-based cluster provisioning for distributed machine learning. Clust. Comput. 22(4), 1299–1315 (2019)
Article Google Scholar
Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank citation ranking: bringing order to the web. Stanford InfoLab, Technical Report 1999-66, November 1999, previous number = SIDL-WP-1999-0120 (1999). http://ilpubs.stanford.edu:8090/422/
Park, J., Kim, , H., Lee, K.: Evaluating concurrent executions of multiple function-as-a-service runtimes with microvm. In: 2020 IEEE 13th International Conference on Cloud Computing (CLOUD) (2020)
Park, J., Lee, K.: Performance prediction of sparse matrix multiplication on a distributed bigdata processing environment. In: 2020 IEEE International Conference on Autonomic Computing and Self-Organizing Systems Companion (ACSOS-C), pp. 30–35 (2020)
Patwary, M.M.A., Satish, N.R., Sundaram, N., Park, J., Anderson, M.J., Vadlamudi, S.G., Das, D., Pudov, S.G., Pirogov, V.O., Dubey, P.: Parallel efficient sparse matrix–matrix multiplication on multicore platforms. In: Kunkel, J.M., Ludwig, T. (eds.) High Performance Computing, pp. 48–57. Springer, Cham (2015)
Chapter Google Scholar
Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Khosla, A., Bernstein, M., Berg, A.C., Fei-Fei, L.: ImageNet large scale visual recognition challenge. Int. J. Comput. Vis. (IJCV) 115(3), 211–252 (2015)
Article MathSciNet Google Scholar
Seo, S., Yoon, E.J., Kim, J., Jin, S., Kim, J., Maeng, S.: Hama: An efficient matrix computation with the mapreduce framework. In: IEEE Second International Conference on Cloud Computing Technology and Science, vol. 2010, 721–726 (2010)
Shahidinejad, A., Ghobaei-Arani, M., Masdari, M.: Resource provisioning using workload clustering in cloud computing environment: a hybrid approach. Clust. Comput. 24, 1–24 (2021)
Article Google Scholar
Shen, C., Tong, W., Choo, K.-K.R., Kausar, S.: Performance prediction of parallel computing models to analyze cloud-based big data applications. Clust. Comput. 21, 06 (2018)
Article Google Scholar
Snoek, J., Larochelle, H., Adams, R.P.: Practical Bayesian optimization of machine learning algorithms. In: Proceedings of the 25th International Conference on Neural Information Processing Systems, vol. 2, ser. NIPS’12. Curran Associates, Red Hook, pp. 2951–2959 (2012). http://dl.acm.org/citation.cfm?id=2999325.2999464
Son, M., Lee, K.: Distributed matrix multiplication performance estimator for machine learning jobs in cloud computing. In: 2018 IEEE 11th International Conference on Cloud Computing (CLOUD), July 2018, pp. 638–645 (2018). https://doi.org/10.1109/CLOUD.2018.00088
Spark, A.: Apache spark MLlib distributed matrix computation (2017). https://goo.gl/Vnii2M. Accessed 20 Nov 2017
van de Geijn, R.A., Watts, J.: Summa: Scalable universal matrix multiplication algorithm. Tech. Rep, Austin, TX, USA (1995)
Venkataraman, S., Yang, Z., Franklin, M.J., Recht, B., Stoica, I.: Ernest: efficient performance prediction for large-scale advanced analytics. In: NSDI, pp. 363–378 (2016)
Wieder, A., Bhatotia, P., Post, A., Rodrigues, R.: Orchestrating the deployment of computations in the cloud with conductor. In: Presented as part of the 9th USENIX Symposium on Networked Systems Design and Implementation (NSDI 12). USENIX, San Jose, pp. 367–381 (2012). https://www.usenix.org/conference/nsdi12/technical-sessions/presentation/wieder
Yadwadkar, N.J., Hariharan, B., Gonzalez, J.E., Smith, B., Katz, R.H.: Selecting the best VM across multiple public clouds: a data-driven performance modeling approach. In: Proceedings of the 2017 Symposium on Cloud Computing, ser. SoCC ’17. ACM, New York, pp. 452–465 (2017). http://doi.acm.org/10.1145/3127479.3131614
Yu, Y., Tang, M., Aref, W.G., Malluhi, Q.M., Abbas, M.M., Ouzzani, M.: In-memory distributed matrix computation processing and optimization. In: ICDE, April 2017, pp. 1047–1058 (2017)
Zaharia, M., Chowdhury, M., Das, T., Dave, A., Ma, J., McCauly, M., Franklin, M.J., Shenker, S., Stoica, I.: ”Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing. In: Presented as part of the 9th USENIX Symposium on Networked Systems Design and Implementation (NSDI 12), pp. 15–28. USENIX, San Jose (2012)

Download references

Acknowledgements

This work is supported by the National Research Foundation of Korea (NRF) Grant funded by the Korean Government (MSIP) (NRF-2020R1A2C1102544, NRF-2016R1C1B2015135, and NRF-2015R1A5A7037615), the ICT R&D program of IITP (2017-0-00396), and Research Credits provided by AWS.

Author information

Authors and Affiliations

Department of Computer Science, Kookmin University, Seoul, South Korea
Jueon Park & Kyungyong Lee

Authors

Jueon Park
View author publications
You can also search for this author in PubMed Google Scholar
Kyungyong Lee
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kyungyong Lee.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Park, J., Lee, K. S-MPEC: Sparse Matrix Multiplication Performance Estimator on a Cloud Environment. Cluster Comput 26, 2563–2576 (2023). https://doi.org/10.1007/s10586-021-03287-3

Download citation

Received: 09 November 2020
Revised: 09 April 2021
Accepted: 17 April 2021
Published: 12 May 2021
Issue Date: October 2023
DOI: https://doi.org/10.1007/s10586-021-03287-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

S-MPEC: Sparse Matrix Multiplication Performance Estimator on a Cloud Environment

Abstract

Access this article

Similar content being viewed by others

Big data analytics on Apache Spark

Big data preprocessing: methods and prospects

Big data analytics: a survey

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

S-MPEC: Sparse Matrix Multiplication Performance Estimator on a Cloud Environment

Abstract

Access this article

Similar content being viewed by others

Big data analytics on Apache Spark

Big data preprocessing: methods and prospects

Big data analytics: a survey

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation