Skip to main content

PM4Py-GPU: A High-Performance General-Purpose Library for Process Mining

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 446))

Abstract

Open-source process mining provides many algorithms for the analysis of event data which could be used to analyze mainstream processes (e.g., O2C, P2P, CRM). However, compared to commercial tools, they lack the performance and struggle to analyze large amounts of data. This paper presents PM4Py-GPU, a Python process mining library based on the NVIDIA RAPIDS framework. Thanks to the dataframe columnar storage and the high level of parallelism, a significant speed-up is achieved on classic process mining computations and processing activities.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    https://www.pads.rwth-aachen.de/go/id/ezupn/lidx/1.

  2. 2.

    https://www.pads.rwth-aachen.de/go/id/khbht.

References

  1. Cano, A.: A survey on graphic processing unit computing for large-scale data mining. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 8(1) (2018). https://doi.org/10.1002/widm.1232

  2. Ferreira, D.R., Santos, R.M.: Parallelization of transition counting for process mining on multi-core CPUs and GPUs. In: Dumas, M., Fantinato, M. (eds.) BPM 2016. LNBIP, vol. 281, pp. 36–48. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-58457-7_3

  3. Hernández, S., van Zelst, S.J., Ezpeleta, J., van der Aalst, W.M.P.: Handling big(ger) logs: Connecting prom 6 to apache hadoop. In: Daniel, F., Zugal, S. (eds.) Proceedings of the BPM Demo Session 2015 Co-located with the 13th International Conference on Business Process Management (BPM 2015), Innsbruck, Austria, 2 September 2015. CEUR Workshop Proceedings, vol. 1418, pp. 80–84. CEUR-WS.org (2015). http://ceur-ws.org/Vol-1418/paper17.pdf

  4. Kundra, D., Juneja, P., Sureka, A.: Vidushi: parallel implementation of alpha miner algorithm and performance analysis on CPU and GPU architecture. In: Reichert, M., Reijers, H.A. (eds.) BPM 2015. LNBIP, vol. 256, pp. 230–241. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-42887-1_19

  5. Nogueira, A.F., Rela, M.Z.: Monitoring a CI/CD workflow using process mining. SN Comput. Sci. 2(6), 448 (2021). https://doi.org/10.1007/s42979-021-00830-2

    Article  Google Scholar 

  6. Pasquadibisceglie, V., Appice, A., Castellano, G., Malerba, D.: Using convolutional neural networks for predictive process analytics. In: International Conference on Process Mining, ICPM 2019, Aachen, Germany, 24–26 June 2019, pp. 129–136. IEEE (2019). https://doi.org/10.1109/ICPM.2019.00028

  7. Tax, N., Verenich, I., La Rosa, M., Dumas, M.: Predictive business process monitoring with LSTM neural networks. In: Dubois, E., Pohl, K. (eds.) CAiSE 2017. LNCS, vol. 10253, pp. 477–492. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59536-8_30

Download references

Acknowledgement

We thank the Alexander von Humboldt (AvH) Stiftung for supporting our research.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alessandro Berti .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Berti, A., Nghia, M.P., van der Aalst, W.M.P. (2022). PM4Py-GPU: A High-Performance General-Purpose Library for Process Mining. In: Guizzardi, R., Ralyté, J., Franch, X. (eds) Research Challenges in Information Science. RCIS 2022. Lecture Notes in Business Information Processing, vol 446. Springer, Cham. https://doi.org/10.1007/978-3-031-05760-1_49

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-05760-1_49

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-05759-5

  • Online ISBN: 978-3-031-05760-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics