Automatic task recognition in a flexible endoscopy benchtop trainer with semi-supervised learning

Int J Comput Assist Radiol Surg. 2020 Sep;15(9):1585-1595. doi: 10.1007/s11548-020-02208-w. Epub 2020 Jun 26.

Abstract

Purpose: Inexpensive benchtop training systems offer significant advantages to meet the increasing demand of training surgeons and gastroenterologists in flexible endoscopy. Established scoring systems exist, based on task duration and mistake evaluation. However, they require trained human raters, which limits broad and low-cost adoption. There is an unmet and important need to automate rating with machine learning.

Method: We present a general and robust approach for recognizing training tasks from endoscopic training video, which consequently automates task duration computation. Our main technical novelty is to show the performance of state-of-the-art CNN-based approaches can be improved significantly with a novel semi-supervised learning approach, using both labelled and unlabelled videos. In the latter case, we assume only the task execution order is known a priori.

Results: Two video datasets are presented: the first has 19 videos recorded in examination conditions, where the participants complete their tasks in predetermined order. The second has 17 h of videos recorded in self-assessment conditions, where participants complete one or more tasks in any order. For the first dataset, we obtain a mean task duration estimation error of 3.65 s, with a mean task duration of 159 s ([Formula: see text] relative error). For the second dataset, we obtain a mean task duration estimation error of 3.67 s. We reduce an average of 5.63% in error to 3.67% thanks to our semi-supervised learning approach.

Conclusion: This work is the first significant step forward to automate rating of flexible endoscopy students using a low-cost benchtop trainer. Thanks to our semi-supervised learning approach, we can scale easily to much larger unlabelled training datasets. The approach can also be used for other phase recognition tasks.

Keywords: Benchtop simulator; Education; Flexible endoscopy; Phase recognition; Semi-supervised learning; Skill.

MeSH terms

  • Algorithms
  • Diagnosis, Computer-Assisted
  • Endoscopes*
  • Endoscopy / education*
  • Equipment Design
  • Gastroenterology / education*
  • Gastroenterology / instrumentation
  • Humans
  • Internship and Residency
  • Machine Learning*
  • Pattern Recognition, Automated*
  • Reproducibility of Results
  • Supervised Machine Learning*
  • Task Performance and Analysis
  • Video Recording