Evaluation of deep learning methods for parotid gland segmentation from CT images

Annika Hänsch; Michael Schwier; Tobias Gass; Tomasz Morgas; Benjamin Haas; Volker Dicken; Hans Meine; Jan Klein; Horst K Hahn

doi:10.1117/1.JMI.6.1.011005

Evaluation of deep learning methods for parotid gland segmentation from CT images

J Med Imaging (Bellingham). 2019 Jan;6(1):011005. doi: 10.1117/1.JMI.6.1.011005. Epub 2018 Oct 1.

Authors

Annika Hänsch¹, Michael Schwier¹, Tobias Gass², Tomasz Morgas³, Benjamin Haas², Volker Dicken¹, Hans Meine¹, Jan Klein¹, Horst K Hahn¹

Affiliations

¹ Fraunhofer MEVIS, Bremen, Germany.
² Varian Medical Systems Imaging Laboratory GmbH, Baden-Dättwil, Switzerland.
³ Varian Medical Systems, Las Vegas, Nevada, United States.

Abstract

The segmentation of organs at risk is a crucial and time-consuming step in radiotherapy planning. Good automatic methods can significantly reduce the time clinicians have to spend on this task. Due to its variability in shape and low contrast to surrounding structures, segmenting the parotid gland is challenging. Motivated by the recent success of deep learning, we study the use of two-dimensional (2-D), 2-D ensemble, and three-dimensional (3-D) U-Nets for segmentation. The mean Dice similarity to ground truth is $\sim 0.83$ for all three models. A patch-based approach for class balancing seems promising for false-positive reduction. The 2-D ensemble and 3-D U-Net are applied to the test data of the 2015 MICCAI challenge on head and neck autosegmentation. Both deep learning methods generalize well onto independent data (Dice 0.865 and 0.88) and are superior to a selection of model- and atlas-based methods with respect to the Dice coefficient. Since appropriate reference annotations are essential for training but often difficult and expensive to obtain, it is important to know how many samples are needed for training. We evaluate the performance after training with different-sized training sets and observe no significant increase in the Dice coefficient for more than 250 training cases.

Keywords: autocontouring; deep learning; head and neck; radiotherapy planning; segmentation.