Skip to main content

Supporting Event Log Extraction Based on Matching

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Business Information Processing ((LNBIP,volume 460))

Abstract

Process mining allows organizations to obtain relevant insights into the execution of their processes. However, the starting point of any process mining analysis is an event log, which is typically not readily available in practice. The extraction of event logs from the relevant databases is a manual and highly time-consuming task, and often a hurdle for the application of process mining altogether. Available support for event log extraction comes with different assumptions and requirements and only provides limited automated support. In this paper, we therefore take a novel angle at supporting event log extraction. The core idea of our paper is to use an existing process model as a starting point and automatically identify to which database tables the activities of the considered process model relate to. Based on the resulting mapping, an event log can then be extracted in an automated fashion. We use this paper to define a first approach that is able to identify such a mapping between a process model and a database. We evaluate our approach using three real-world databases and five process models from the purchase-to-pay domain. The results of our evaluation show that our approach has the potential to successfully support event log extraction based on matching.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. van der Aalst, W.M.P.: Process Mining: Data Science in Action. Springer, Heidelberg (2016)

    Book  Google Scholar 

  2. Calvanese, D., Kalayci, T.E., Montali, M., Santoso, A.: OBDA for log extraction in process mining. In: Ianni, G., et al. (eds.) Reasoning Web 2017. LNCS, vol. 10370, pp. 292–345. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-61033-7_9

    Chapter  Google Scholar 

  3. Calvanese, D., Kalayci, T.E., Montali, M., Tinella, S.: Ontology-based data access for extracting event logs from legacy data: the onprom tool and methodology. In: Abramowicz, W. (ed.) BIS 2017. LNBIP, vol. 288, pp. 220–236. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-59336-4_16

    Chapter  Google Scholar 

  4. Diba, K., Batoulis, K., Weidlich, M., Weske, M.: Extraction, correlation, and abstraction of event data for process mining. WIREs Data Min. Knowl. Discov. 10(3) (2020)

    Google Scholar 

  5. Dumas, M., La Rosa, M., Mendling, J., Reijers, H.A.: Fundamentals of Business Process Management. Springer, Heidelberg (2018)

    Book  Google Scholar 

  6. Figl, K., Mendling, J., Strembeck, M.: The influence of notational deficiencies on process model comprehension. J. Assoc. Inf. Syst. 14, 312–338 (2013)

    Google Scholar 

  7. Gal, A.: Uncertain Schema Matching, vol. 3. Morgan & Claypool (2011)

    Google Scholar 

  8. Jagroep, E., Van der Werf, J.M., Broekman, J., Blom, L., van Vliet, R., Brinkkemper, S.: A resource utilization score for software energy consumption. In: Proceedings of ICT for Sustainability 2016 (2016)

    Google Scholar 

  9. Jans, M., Alles, M., Vasarhelyi, M.: The case for process mining in auditing: sources of value added and areas of application. Int. J. Account. Inf. Syst. 14, 1–20 (2013)

    Article  Google Scholar 

  10. Jean-Mary, Y.R., Shironoshita, E.P., Kabuka, M.R.: Ontology matching with semantic verification. Web Semant. 7(3), 235–251 (2009)

    Article  Google Scholar 

  11. Lambrix, P., Tan, H.: Sambo - a system for aligning and merging biomedical ontologies. J. Web Semant. 4(3), 196–206 (2006)

    Article  Google Scholar 

  12. Leopold, H., Niepert, M., Weidlich, M., Mendling, J., Dijkman, R., Stuckenschmidt, H.: Probabilistic optimization of semantic process model matching. In: Barros, A., Gal, A., Kindler, E. (eds.) BPM 2012. LNCS, vol. 7481, pp. 319–334. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-32885-5_25

    Chapter  Google Scholar 

  13. Li, G., de Murillas, E.G.L., de Carvalho, R.M., van der Aalst, W.M.P.: Extracting object-centric event logs to support process mining on databases. In: Mendling, J., Mouratidis, H. (eds.) CAiSE 2018. LNBIP, vol. 317, pp. 182–199. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-92901-9_16

    Chapter  Google Scholar 

  14. Madhavan, J., Bernstein, P., Rahm, E.: Generic schema matching with cupid. In: Proceedings of the 27th VLDB Conference (2001)

    Google Scholar 

  15. Meilicke, C., Leopold, H., Kuss, E., Stuckenschmidt, H., Reijers, H.A.: Overcoming individual process model matcher weaknesses using ensemble matching. Decis. Support Syst. 100, 15–26 (2017)

    Article  Google Scholar 

  16. Murillas, E., Reijers, H., Aalst, W.: Connecting databases with process mining: a meta model and toolset. Softw. Syst. Model. 231–249 (2016)

    Google Scholar 

  17. Nikovski, D., Esenther, A., Ye, X., Shiba, M., Takayama, S.: Matcher composition methods for automatic schema matching. In: Cordeiro, J., Maciaszek, L.A., Filipe, J. (eds.) ICEIS 2012. LNBIP, vol. 141, pp. 108–123. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-40654-6_7

    Chapter  Google Scholar 

  18. Post, R., et al.: Active anomaly detection for key item selection in process auditing. In: Munoz-Gama, J., Lu, X. (eds.) ICPM 2021. LNBIP, vol. 433, pp. 167–179. Springer, Cham (2022). https://doi.org/10.1007/978-3-030-98581-3_13

    Chapter  Google Scholar 

  19. Saldaña, J.: The Coding Manual for Qualitative Researchers. Sage (2009)

    Google Scholar 

  20. Stein Dani, V., et al.: Towards understanding the role of the human in event log extraction. In: Marrella, A., Weber, B. (eds.) BPM 2021. LNBIP, vol. 436, pp. 86–98. Springer, Cham (2022). https://doi.org/10.1007/978-3-030-94343-1_7

    Chapter  Google Scholar 

  21. van der Aa, H., Leopold, H., Reijers, H.A.: Comparing textual descriptions to process models - the automatic detection of inconsistencies. Inf. Syst. 64, 447–460 (2017)

    Article  Google Scholar 

  22. Aalst, W.M.P.: Extracting event data from databases to unleash process mining. In: vom Brocke, J., Schmiedel, T. (eds.) BPM - Driving Innovation in a Digital World. MP, pp. 105–128. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-14430-6_8

    Chapter  Google Scholar 

  23. Weidlich, M., Dijkman, R., Mendling, J.: The ICoP framework: identification of correspondences between process models. In: Pernici, B. (ed.) CAiSE 2010. LNCS, vol. 6051, pp. 483–498. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-13094-6_37

    Chapter  Google Scholar 

  24. Weske, M., Decker, G., Dumas, M., La Rosa, M., Mendling, J., Reijers, H.A.: Model collection of the bpm academic initiative (2020)

    Google Scholar 

Download references

Acknowledgements

Part of this research was funded by NWO (Netherlands Organisation for Scientific Research) project number 16672.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vinicius Stein Dani .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Stein Dani, V., Leopold, H., van der Werf, J.M.E.M., Reijers, H.A. (2023). Supporting Event Log Extraction Based on Matching. In: Cabanillas, C., Garmann-Johnsen, N.F., Koschmider, A. (eds) Business Process Management Workshops. BPM 2022. Lecture Notes in Business Information Processing, vol 460. Springer, Cham. https://doi.org/10.1007/978-3-031-25383-6_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-25383-6_24

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-25382-9

  • Online ISBN: 978-3-031-25383-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics