Skip to content
Licensed Unlicensed Requires Authentication Published by De Gruyter Oldenbourg June 1, 2023

IoT-enriched event log generation and quality analytics: a case study

  • Joscha Grüger

    Joscha Grüger, M.Sc. is a research associate and PhD candidate at the Chair of Artificial Intelligence and Intelligent Information Systems at the University of Trier and researches at the German Research Center for Artificial Intelligence (DFKI) Trier Branch. In 2019, he received his master’s degree in computer science from Trier University of Applied Science. His research focuses on the use of artificial intelligence methods, process mining, and business process management technologies, especially in the healthcare domain and in the context of the Internet of Things.

    EMAIL logo
    , Lukas Malburg

    Lukas Malburg, M.Sc. is a research assistant and PhD student at the Department of Artificial Intelligence and Intelligent Information Systems at Trier University. In addition, he is part of the German Research Center for Artificial Intelligence (DFKI) branch Trier since May 2021. In 2019, he obtained his Master degree in business informatics from the University of Trier. In his research, he examines the use of Artificial Intelligence methods, in particular (Process-Oriented) Case-Based Reasoning and AI Planning, in Cyber-Physical Production Systems and the Internet of Things combined with Business Process Management technologies.

    and Ralph Bergmann

    Prof. Dr. Ralph Bergmann is full professor at Trier University since 2004 and is directing a research group on business informatics with a strong focus on Artificial Intelligence. Since 2020 he is also topic-field leader for experience-based learning systems at the Trier Branch of the German Research Center for AI (DFKI). Over the past 30 years, he has significantly contributed to the foundations and applications of AI, including knowledge-based systems, knowledge representation and reasoning, case-based reasoning, machine learning, AI planning, and semantic technologies. With the current focus on experience-based learning systems he aims at developing hybrid AI-systems integrating data-oriented AI methods (machine learning and case-based reasoning) with semantic technologies for modeling explicit knowledge. He authored more than 250 refereed papers, including four books and 13 edited proceedings volumes and led more than 35 research projects.

Abstract

Modern technologies such as the Internet of Things (IoT) are becoming increasingly important in various fields, including business process management (BPM) research. An important area of research in BPM is process mining, which can be used to analyze event logs e.g., to check the conformance of running processes. However, the data ingested in IoT environments often contain data quality issues (DQIs) due to system complexity and sensor heterogeneity, among other factors. To date, however, there has been little work on IoT event logs, DQIs occurring in them, and how to handle them. In this case study, we generate an IoT event log, perform a structured data quality analysis, and describe how we addressed the problems we encountered in pre-processing.


Corresponding author: Joscha Grüger, Artificial Intelligence and Intelligent Information Systems, University of Trier, 54296 Trier, Germany; and German Research Center for Artificial Intelligence (DFKI), Branch University of Trier, 54296 Trier, Germany, E-mail:

Joscha Grüger and Lukas Malburg contributed equally to the work.


About the authors

Joscha Grüger

Joscha Grüger, M.Sc. is a research associate and PhD candidate at the Chair of Artificial Intelligence and Intelligent Information Systems at the University of Trier and researches at the German Research Center for Artificial Intelligence (DFKI) Trier Branch. In 2019, he received his master’s degree in computer science from Trier University of Applied Science. His research focuses on the use of artificial intelligence methods, process mining, and business process management technologies, especially in the healthcare domain and in the context of the Internet of Things.

Lukas Malburg

Lukas Malburg, M.Sc. is a research assistant and PhD student at the Department of Artificial Intelligence and Intelligent Information Systems at Trier University. In addition, he is part of the German Research Center for Artificial Intelligence (DFKI) branch Trier since May 2021. In 2019, he obtained his Master degree in business informatics from the University of Trier. In his research, he examines the use of Artificial Intelligence methods, in particular (Process-Oriented) Case-Based Reasoning and AI Planning, in Cyber-Physical Production Systems and the Internet of Things combined with Business Process Management technologies.

Ralph Bergmann

Prof. Dr. Ralph Bergmann is full professor at Trier University since 2004 and is directing a research group on business informatics with a strong focus on Artificial Intelligence. Since 2020 he is also topic-field leader for experience-based learning systems at the Trier Branch of the German Research Center for AI (DFKI). Over the past 30 years, he has significantly contributed to the foundations and applications of AI, including knowledge-based systems, knowledge representation and reasoning, case-based reasoning, machine learning, AI planning, and semantic technologies. With the current focus on experience-based learning systems he aims at developing hybrid AI-systems integrating data-oriented AI methods (machine learning and case-based reasoning) with semantic technologies for modeling explicit knowledge. He authored more than 250 refereed papers, including four books and 13 edited proceedings volumes and led more than 35 research projects.

  1. Author contributions: All the authors have accepted responsibility for the entire content of this submitted manuscript and approved submission.

  2. Research funding: This work is funded by the Federal Ministry for Economic Affairs and Climate Action under grant No. 01MD22002C EASY.

  3. Conflict of interest statement: The authors declare no conflicts of interest regarding this article.

References

[1] C. Janiesch, A. Koschmider, M. Mecella, et al.., “The internet of things meets business process management: a manifesto,” IEEE Syst. Man Cybern. Mag., vol. 6, no. 4, pp. 34–44, 2020. https://doi.org/10.1109/msmc.2020.3003135.Search in Google Scholar

[2] L. Malburg, P. Klein, and R. Bergmann, “Semantic web services for AI-research with physical factory simulation models in industry 4.0,” in 1st IN4PL, SciTePress, 2020, pp. 32–43.10.5220/0010135900320043Search in Google Scholar

[3] R. Seiger, L. Malburg, B. Weber, and R. Bergmann, “Integrating process management and event processing in smart factories: a systems architecture and use cases,” J. Manuf. Syst., vol. 63, pp. 575–592, 2022. https://doi.org/10.1016/j.jmsy.2022.05.012.Search in Google Scholar

[4] A. Gaddam, T. Wilkin, M. Angelova, and J. Gaddam, “Detecting sensor faults, anomalies and outliers in the internet of things: a survey on the challenges and solutions,” Electronics, vol. 9, no. 3, p. 511, 2020. https://doi.org/10.3390/electronics9030511.Search in Google Scholar

[5] W. M. P. van der Aalst, A. Adriansyah, and A. K. A. de Medeiros, et al.., “Process mining manifesto,” in BPM Workshops, Springer, 2012, pp. 169–194.Search in Google Scholar

[6] J. C. J. C. Bose, R. S. Mans, and W. M. P. van der Aalst, “Wanna improve process mining results?” in CIDM, IEEE, 2013, pp. 127–134.Search in Google Scholar

[7] R. Verhulst, “Evaluating quality of event data within event logs: an extensible framework,” Master thesis, 2016.Search in Google Scholar

[8] L. Malburg, J. Grüger, and R. Bergmann, “Dataset: an Iot-enriched event log for process mining in smart factories,” 2022.Search in Google Scholar

[9] J. Mangler, J. Grüger, L. Malburg, et al.., “DataStream XES extension: embedding IoT sensor data into extensible event stream logs,” Future Internet, vol. 15, no. 3, p. 109, 2023. https://doi.org/10.3390/fi15030109.Search in Google Scholar

[10] L. Malburg, M. Hoffmann, and R. Bergmann, “Applying MAPE-K control loops for adaptive workflow management in smart factories,” J. Intell. Inf. Syst., pp. 1–29, 2023. https://doi.org/10.1007/s10844-022-00766-w.Search in Google Scholar

[11] L. Malburg, R. Seiger, R. Bergmann, and B. Weber, “Using physical factory simulation models for business process management research,” in BPM Workshops, LNBIP, vol. 397, Springer, 2020, pp. 95–107.10.1007/978-3-030-66498-5_8Search in Google Scholar

[12] M. Compton, P. Barnaghi, L. Bermudez, et al.., “The SSN ontology of the W3C semantic sensor network incubator group,” J. Web Semant., vol. 17, pp. 25–32, 2012. https://doi.org/10.1016/j.websem.2012.05.003.Search in Google Scholar

[13] L. Mazzola, P. Kapahnke, M. Vujic, and M. Klusch, “CDM-core: a manufacturing domain ontology in OWL2 for production and maintenance,” in 8th KEOD, 2016, pp. 136–143.10.5220/0006056301360143Search in Google Scholar

[14] M. Bermudez-Edo, T. Elsaleh, P. Barnaghi, and K. Taylor, “IoT-lite: a lightweight semantic model for the internet of things,” in UIC-ATC-ScalCom-CBDCom-IoP-SmartWorld 2016, IEEE, 2016, pp. 90–97.10.1109/UIC-ATC-ScalCom-CBDCom-IoP-SmartWorld.2016.0035Search in Google Scholar

[15] K. Janowicz, A. Haller, S. J. Cox, D. Le Phuoc, and M. Lefrançois, “SOSA: a lightweight ontology for sensors, observations, samples, and actuators,” J. Web Semant., vol. 56, pp. 1–10, 2019. https://doi.org/10.1016/j.websem.2018.06.003.Search in Google Scholar

[16] T. Elsaleh, M. Bermudez-Edo, S. Enshaeifar, S. T. Acton, R. Rezvani, and P. Barnaghi, “IoT-stream: a lightweight ontology for internet of things data streams,” in Global IoT Summit Proc, IEEE, 2019, pp. 1–6.10.1109/GIOTS.2019.8766367Search in Google Scholar

[17] S. Lemaignan, A. Siadat, J. Y. Dantan, and A. Semenenko, “MASON: a proposal for an ontology of manufacturing domain,” in Workshop on Distrib. Intell. Syst.: Collect. Intell. and its Appl., IEEE, 2006, pp. 195–200.Search in Google Scholar

[18] P. Klein, L. Malburg, and R. Bergmann, “FTOnto: a domain ontology for a Fischertechnik simulation production factory by reusing existing ontologies,” in 21st LWDA, vol. 2454, CEUR-WS.org, 2019, pp. 253–264.Search in Google Scholar

[19] C. W. Günther and E. Verbeek, XES Standard Definition – Version 2.0, 2014.Search in Google Scholar

[20] H. M. Marin-Castro and E. Tello-Leal, “Event log preprocessing for process mining: a review,” Appl. Sci., vol. 11, no. 22, p. 10556, 2021. https://doi.org/10.3390/app112210556.Search in Google Scholar

[21] M. T. Wynn, J. Lebherz, W. M. P. van der Aalst, et al.., “Rethinking the input for process mining: insights from the XES survey and workshop,” in 3rd ICPM Workshops, LNBIP, vol. 433, Springer, 2021, pp. 3–16.10.1007/978-3-030-98581-3_1Search in Google Scholar

[22] B. F. van Dongen and W. M. P. van der Aalst, “A meta model for process mining data,” in EMOI-INTEROP, vol. 160, 2005, p. 30.Search in Google Scholar

[23] B. F. van Dongen and S. Shabani, “Relational XES: data management for process mining,” in CAiSE Forum, 2015.Search in Google Scholar

[24] A. F. Ghahfarokhi, G. Park, A. Berti, and W. M. P. van der Aalst, “OCEL: a standard for object-centric event logs,” in CCIS, vol. 1450, Springer, 2021, pp. 169–175.10.1007/978-3-030-85082-1_16Search in Google Scholar

[25] V. Popova, D. Fahland, and M. Dumas, “Artifact lifecycle discovery,” CoRR abs/1303.2554, 2013.Search in Google Scholar

[26] Y. Bertrand, J. De Weerdt, and E. Serral, “A bridging model for process mining and iot,” in Process Mining Workshops, J. Munoz-Gama and X. Lu, Eds., Springer, 2022, pp. 98–110.10.1007/978-3-030-98581-3_8Search in Google Scholar

[27] J. Wei, C. Ouyang, A. H. ter Hofstede, and C. Moreira, “AMORETTO: a method for deriving IoT-enriched event logs,” CoRR abs/2212.02071, 2022.Search in Google Scholar

[28] S. A. Alasadi and W. Bhaya, “Review of data preprocessing techniques in data mining,” J. Eng. Appl. Sci., vol. 12, pp. 4102–4107, 2017.Search in Google Scholar

[29] V. Agarwal, “Research on data preprocessing and categorization technique for smartphone review analysis,” Int. J. Comput. Appl., vol. 131, no. 4, pp. 30–36, 2015. https://doi.org/10.5120/ijca2015907309.Search in Google Scholar

[30] U. M. Fayyad and G. Piatetsky-Shapiro, “Padhraic smyth: from data mining to knowledge Discovery in databases,” AI Mag., vol. 17, pp. 37–54, 1996.Search in Google Scholar

[31] S. Suriadi, R. Andrews, A. ter Hofstede, and M. T. Wynn, “Event log imperfection patterns for process mining: towards a systematic approach to cleaning event logs,” Inf. Syst., vol. 64, pp. 132–150, 2017. https://doi.org/10.1016/j.is.2016.07.011.Search in Google Scholar

[32] W. M. P. van der Aalst, “Extracting event data from databases to unleash process mining,” in Management for Professionals, Cham, Springer International Publishing, 2015, pp. 105–128.10.1007/978-3-319-14430-6_8Search in Google Scholar

[33] R. Seiger, F. Zerbato, A. Burattin, L. Garcia-Banuelos, and B. Weber, “Towards IoT-driven process event log generation for conformance checking in smart factories,” in 24th EDOC Workshops, IEEE, 2020, pp. 20–26.10.1109/EDOCW49879.2020.00016Search in Google Scholar

Received: 2022-12-23
Accepted: 2023-05-16
Published Online: 2023-06-01
Published in Print: 2023-06-28

© 2023 Walter de Gruyter GmbH, Berlin/Boston

Downloaded on 30.4.2024 from https://www.degruyter.com/document/doi/10.1515/itit-2022-0077/html
Scroll to top button