Long-term performance assessment of fully automatic biomedical glottis segmentation at the point of care

René Groh; Stephan Dürr; Anne Schützenberger; Marion Semmler; Andreas M Kist

doi:10.1371/journal.pone.0266989

Long-term performance assessment of fully automatic biomedical glottis segmentation at the point of care

PLoS One. 2022 Sep 21;17(9):e0266989. doi: 10.1371/journal.pone.0266989. eCollection 2022.

Authors

René Groh¹, Stephan Dürr², Anne Schützenberger², Marion Semmler², Andreas M Kist¹

Affiliations

¹ Department Artificial Intelligence in Biomedical Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Bavaria, Germany.
² Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Bavaria, Germany.

Abstract

Deep Learning has a large impact on medical image analysis and lately has been adopted for clinical use at the point of care. However, there is only a small number of reports of long-term studies that show the performance of deep neural networks (DNNs) in such an environment. In this study, we measured the long-term performance of a clinically optimized DNN for laryngeal glottis segmentation. We have collected the video footage for two years from an AI-powered laryngeal high-speed videoendoscopy imaging system and found that the footage image quality is stable across time. Next, we determined the DNN segmentation performance on lossy and lossless compressed data revealing that only 9% of recordings contain segmentation artifacts. We found that lossy and lossless compression is on par for glottis segmentation, however, lossless compression provides significantly superior image quality. Lastly, we employed continual learning strategies to continuously incorporate new data into the DNN to remove the aforementioned segmentation artifacts. With modest manual intervention, we were able to largely alleviate these segmentation artifacts by up to 81%. We believe that our suggested deep learning-enhanced laryngeal imaging platform consistently provides clinically sound results, and together with our proposed continual learning scheme will have a long-lasting impact on the future of laryngeal imaging.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Artifacts
Glottis / diagnostic imaging
Image Processing, Computer-Assisted / methods
Larynx* / diagnostic imaging
Neural Networks, Computer
Point-of-Care Systems*

Grants and funding

This work was funded in part by the German Research Foundation (DFG, https://www.dfg.de/) under grant no. SCHU 3441/3-2 (to AS).