Prediction of laparoscopic procedure duration using unlabeled, multimodal sensor data

Bodenstedt, Sebastian; Wagner, Martin; Mündermann, Lars; Kenngott, Hannes; Müller-Stich, Beat; Breucha, Michael; Mees, Sören Torge; Weitz, Jürgen; Speidel, Stefanie

doi:10.1007/s11548-019-01966-6

Prediction of laparoscopic procedure duration using unlabeled, multimodal sensor data

Original Article
Published: 09 April 2019

Volume 14, pages 1089–1095, (2019)
Cite this article

International Journal of Computer Assisted Radiology and Surgery Aims and scope Submit manuscript

Sebastian Bodenstedt ORCID: orcid.org/0000-0002-2203-9729¹,
Martin Wagner²,
Lars Mündermann³,
Hannes Kenngott²,
Beat Müller-Stich²,
Michael Breucha⁴,
Sören Torge Mees⁴,
Jürgen Weitz⁴ &
…
Stefanie Speidel¹

903 Accesses
35 Citations
Explore all metrics

Abstract

Purpose

The course of surgical procedures is often unpredictable, making it difficult to estimate the duration of procedures beforehand. This uncertainty makes scheduling surgical procedures a difficult task. A context-aware method that analyses the workflow of an intervention online and automatically predicts the remaining duration would alleviate these problems. As basis for such an estimate, information regarding the current state of the intervention is a requirement.

Methods

Today, the operating room contains a diverse range of sensors. During laparoscopic interventions, the endoscopic video stream is an ideal source of such information. Extracting quantitative information from the video is challenging though, due to its high dimensionality. Other surgical devices (e.g., insufflator, lights, etc.) provide data streams which are, in contrast to the video stream, more compact and easier to quantify. Though whether such streams offer sufficient information for estimating the duration of surgery is uncertain. In this paper, we propose and compare methods, based on convolutional neural networks, for continuously predicting the duration of laparoscopic interventions based on unlabeled data, such as from endoscopic image and surgical device streams.

Results

The methods are evaluated on 80 recorded laparoscopic interventions of various types, for which surgical device data and the endoscopic video streams are available. Here the combined method performs best with an overall average error of 37% and an average halftime error of approximately 28%.

Conclusion

In this paper, we present, to our knowledge, the first approach for online procedure duration prediction using unlabeled endoscopic video data and surgical device data in a laparoscopic setting. Furthermore, we show that a method incorporating both vision and device data performs better than methods based only on vision, while methods only based on tool usage and surgical device data perform poorly, showing the importance of the visual channel.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Deep Learning: A Comprehensive Overview on Techniques, Taxonomy, Applications and Research Directions

Article 18 August 2021

Iqbal H. Sarker

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

Article Open access 31 March 2021

Laith Alzubaidi, Jinglan Zhang, … Laith Farhan

Machine Learning and Artificial Intelligence: Definitions, Applications, and Future Directions

Article 25 January 2020

J. Matthew Helm, Andrew M. Swiergosz, … Prem N. Ramkumar

References

Aksamentov I, Twinanda AP, Mutter D, Marescaux J, Padoy N (2017) Deep neural networks predict remaining surgery duration from cholecystectomy videos. In: International conference on medical image computing and computer-assisted intervention. Springer, pp 586–593
Blum T, Feußner H, Navab N (2010) Modeling and segmentation of surgical workflow from laparoscopic video. In: International conference on medical image computing and computer-assisted intervention. Springer, pp 400–407
Bodenstedt S (2018) Image-based scene analysis for computer-assisted laparoscopic surgery. Ph.D. thesis, Karlsruhe Institute of Technology. https://doi.org/10.5445/IR/1000084137
Bodenstedt S, Wagner M, Katić D, Mietkowski P, Mayer B, Kenngott H, Müller-Stich B, Dillmann R, Speidel S (2017) Unsupervised temporal context learning using convolutional neural networks for laparoscopic workflow analysis. ArXiv e-prints
Cho K, van Merrienboer B, Bahdanau D, Bengio Y (2014) On the properties of neural machine translation: Encoder–decoder approaches. In: Proceedings of SSST-8, eighth workshop on syntax, semantics and structure in statistical translation. Association for Computational Linguistics, pp. 103–111. https://doi.org/10.3115/v1/W14-4012. http://aclweb.org/anthology/W14-4012
Dergachyova O, Bouget D, Huaulmé A, Morandi X, Jannin P (2016) Automatic data-driven real-time segmentation and recognition of surgical workflow. Int J Comput Assist Radiol Surg 11(6):1081–1089
Article PubMed Google Scholar
Guédon ACP, Paalvast M, Meeuwsen FC, Tax DMJ, van Dijke AP, Wauben LSGL, van der Elst M, Dankelman J, van den Dobbelsteen JJ (2016) ‘it is time to prepare the next patient’ real-time prediction of procedure duration in laparoscopic cholecystectomies. J Med Syst 40(12):271. https://doi.org/10.1007/s10916-016-0631-1
Article PubMed PubMed Central Google Scholar
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780
Article CAS PubMed Google Scholar
Katić D, Wekerle AL, Gärtner F, Kenngott H, Müller-Stich BP, Dillmann R, Speidel S (2014) Knowledge-driven formalization of laparoscopic surgeries for rule-based intraoperative context-aware assistance. In: International conference on information processing in computer-assisted interventions. Springer, pp 158–167
Kingma D, Ba J (2014) Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
Lea C, Choi JH, Reiter A, Hager G (2016) Surgical phase recognition: from instrumented ors to hospitals around the world. M2CAI 2016
Padoy N, Blum T, Ahmadi SA, Feussner H, Berger MO, Navab N (2012) Statistical modeling and recognition of surgical workflow. Med Image Anal 16(3):632–641
Article PubMed Google Scholar
Twinanda AP, Shehata S, Mutter D, Marescaux J, de Mathelin M, Padoy N (2017) Endonet: a deep architecture for recognition tasks on laparoscopic videos. IEEE Trans Med Imag 36(1):86–97
Article Google Scholar
Twinanda AP, Yengera G, Mutter D, Marescaux J, Padoy N (2018) Rsdnet: learning to predict remaining surgery duration from laparoscopic videos without manual annotations. IEEE Trans Med Imag. https://doi.org/10.1109/TMI.2018.2878055

Download references

Author information

Authors and Affiliations

Department for Translational Surgical Oncology, National Center for Tumor Diseases (NCT), Partner Site Dresden, Dresden, Germany
Sebastian Bodenstedt & Stefanie Speidel
Department of General, Visceral and Transplant Surgery, University of Heidelberg, Heidelberg, Germany
Martin Wagner, Hannes Kenngott & Beat Müller-Stich
KARL STORZ SE & Co. KG, Tuttlingen, Germany
Lars Mündermann
Department of Visceral, Thoracic and Vascular Surgery, Faculty of Medicine and University Hospital Carl Gustav Carus, TU Dresden, Dresden, Germany
Michael Breucha, Sören Torge Mees & Jürgen Weitz

Authors

Sebastian Bodenstedt
View author publications
You can also search for this author in PubMed Google Scholar
Martin Wagner
View author publications
You can also search for this author in PubMed Google Scholar
Lars Mündermann
View author publications
You can also search for this author in PubMed Google Scholar
Hannes Kenngott
View author publications
You can also search for this author in PubMed Google Scholar
Beat Müller-Stich
View author publications
You can also search for this author in PubMed Google Scholar
Michael Breucha
View author publications
You can also search for this author in PubMed Google Scholar
Sören Torge Mees
View author publications
You can also search for this author in PubMed Google Scholar
Jürgen Weitz
View author publications
You can also search for this author in PubMed Google Scholar
Stefanie Speidel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sebastian Bodenstedt.

Ethics declarations

Conflict of interest

S. Bodenstedt, M. Wagner, L. Mündermann, H. Kenngott, B. Müller-Stich, M. Breucha, S. Mees, J. Weitz and S. Speidel declare that they have no conflict of interest.

Ethical approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Declaration of Helsinki and its later amendments or comparable ethical standards

Informed consent

Informed consent was obtained from the study participants

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bodenstedt, S., Wagner, M., Mündermann, L. et al. Prediction of laparoscopic procedure duration using unlabeled, multimodal sensor data. Int J CARS 14, 1089–1095 (2019). https://doi.org/10.1007/s11548-019-01966-6

Download citation

Received: 30 January 2019
Accepted: 03 April 2019
Published: 09 April 2019
Issue Date: 01 June 2019
DOI: https://doi.org/10.1007/s11548-019-01966-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Prediction of laparoscopic procedure duration using unlabeled, multimodal sensor data