Anatomy segmentation in laparoscopic surgery: comparison of machine learning and human expertise - an experimental study

Fiona R Kolbinger; Franziska M Rinner; Alexander C Jenke; Matthias Carstens; Stefanie Krell; Stefan Leger; Marius Distler; Jürgen Weitz; Stefanie Speidel; Sebastian Bodenstedt

doi:10.1097/JS9.0000000000000595

Anatomy segmentation in laparoscopic surgery: comparison of machine learning and human expertise - an experimental study

Int J Surg. 2023 Oct 1;109(10):2962-2974. doi: 10.1097/JS9.0000000000000595.

Authors

Fiona R Kolbinger^{1

2

3}, Franziska M Rinner¹, Alexander C Jenke⁴, Matthias Carstens¹, Stefanie Krell⁴, Stefan Leger^{3

4}, Marius Distler^{1

2}, Jürgen Weitz^{1

2

3

5}, Stefanie Speidel^{3

4

5}, Sebastian Bodenstedt^{4

5}

Affiliations

¹ Department of Visceral, Thoracic and Vascular Surgery, University Hospital and Faculty of Medicine Carl Gustav Carus, Technische Universität Dresden.
² National Center for Tumor Diseases (NCT/UCC), Dresden, Germany: German Cancer Research Center (DKFZ), Heidelberg, Germany; Faculty of Medicine and University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany; Helmholtz-Zentrum Dresden-Rossendorf (HZDR).
³ Else Kröner Fresenius Center for Digital Health (EKFZ), Technische Universität Dresden.
⁴ Department of Translational Surgical Oncology, National Center for Tumor Diseases (NCT)/UCC, Partner Site Dresden.
⁵ Cluster of Excellence "Centre for Tactile Internet with Human-in-the-Loop" (CeTI), Technische Universität Dresden, Dresden, Germany.

Abstract

Background: Lack of anatomy recognition represents a clinically relevant risk in abdominal surgery. Machine learning (ML) methods can help identify visible patterns and risk structures; however, their practical value remains largely unclear.

Materials and methods: Based on a novel dataset of 13 195 laparoscopic images with pixel-wise segmentations of 11 anatomical structures, we developed specialized segmentation models for each structure and combined models for all anatomical structures using two state-of-the-art model architectures (DeepLabv3 and SegFormer) and compared segmentation performance of algorithms to a cohort of 28 physicians, medical students, and medical laypersons using the example of pancreas segmentation.

Results: Mean Intersection-over-Union for semantic segmentation of intra-abdominal structures ranged from 0.28 to 0.83 and from 0.23 to 0.77 for the DeepLabv3-based structure-specific and combined models, and from 0.31 to 0.85 and from 0.26 to 0.67 for the SegFormer-based structure-specific and combined models, respectively. Both the structure-specific and the combined DeepLabv3-based models are capable of near-real-time operation, while the SegFormer-based models are not. All four models outperformed at least 26 out of 28 human participants in pancreas segmentation.

Conclusions: These results demonstrate that ML methods have the potential to provide relevant assistance in anatomy recognition in minimally invasive surgery in near-real-time. Future research should investigate the educational value and subsequent clinical impact of the respective assistance systems.

MeSH terms

Algorithms
Humans
Image Processing, Computer-Assisted / methods
Laparoscopy*
Machine Learning*