Prospectively-validated deep learning model for segmenting swallowing and chewing structures in CT

Aditi Iyer; Maria Thor; Ifeanyirochukwu Onochie; Jennifer Hesse; Kaveh Zakeri; Eve LoCastro; Jue Jiang; Harini Veeraraghavan; Sharif Elguindi; Nancy Y Lee; Joseph O Deasy; Aditya P Apte

doi:10.1088/1361-6560/ac4000

Prospectively-validated deep learning model for segmenting swallowing and chewing structures in CT

Phys Med Biol. 2022 Jan 17;67(2):10.1088/1361-6560/ac4000. doi: 10.1088/1361-6560/ac4000.

Affiliations

¹ Department of Medical Physics, Memorial Sloan Kettering Cancer Center, New York, United States of America.
² Department of Radiation Oncology, Memorial Sloan Kettering Cancer Center, New York, United States of America.

Abstract

Objective.Delineating swallowing and chewing structures aids in radiotherapy (RT) treatment planning to limit dysphagia, trismus, and speech dysfunction. We aim to develop an accurate and efficient method to automate this process.Approach.CT scans of 242 head and neck (H&N) cancer patients acquired from 2004 to 2009 at our institution were used to develop auto-segmentation models for the masseters, medial pterygoids, larynx, and pharyngeal constrictor muscle using DeepLabV3+. A cascaded framework was used, wherein models were trained sequentially to spatially constrain each structure group based on prior segmentations. Additionally, an ensemble of models, combining contextual information from axial, coronal, and sagittal views was used to improve segmentation accuracy. Prospective evaluation was conducted by measuring the amount of manual editing required in 91 H&N CT scans acquired February-May 2021.Main results. Medians and inter-quartile ranges of Dice similarity coefficients (DSC) computed on the retrospective testing set (N = 24) were 0.87 (0.85-0.89) for the masseters, 0.80 (0.79-0.81) for the medial pterygoids, 0.81 (0.79-0.84) for the larynx, and 0.69 (0.67-0.71) for the constrictor. Auto-segmentations, when compared to two sets of manual segmentations in 10 randomly selected scans, showed better agreement (DSC) with each observer than inter-observer DSC. Prospective analysis showed most manual modifications needed for clinical use were minor, suggesting auto-contouring could increase clinical efficiency. Trained segmentation models are available for research use upon request viahttps://github.com/cerr/CERR/wiki/Auto-Segmentation-models.Significance.We developed deep learning-based auto-segmentation models for swallowing and chewing structures in CT and demonstrated its potential for use in treatment planning to limit complications post-RT. To the best of our knowledge, this is the only prospectively-validated deep learning-based model for segmenting chewing and swallowing structures in CT. Segmentation models have been made open-source to facilitate reproducibility and multi-institutional research.

Keywords: auto-segmentation; deep learning; dysphagia; radiation therapy; swallowing and chewing structures; trismus.

Publication types

Research Support, N.I.H., Extramural

MeSH terms

Deep Learning*
Deglutition
Head and Neck Neoplasms*
Humans
Mastication
Organs at Risk
Radiotherapy Planning, Computer-Assisted / methods
Reproducibility of Results
Retrospective Studies
Tomography, X-Ray Computed / methods

Prospectively-validated deep learning model for segmenting swallowing and chewing structures in CT

Authors

Affiliations

Abstract

Publication types

MeSH terms

Grants and funding