Quality assurance for automatically generated contours with additional deep learning

Lars Johannes Isaksson; Paul Summers; Abhir Bhalerao; Sara Gandini; Sara Raimondi; Matteo Pepa; Mattia Zaffaroni; Giulia Corrao; Giovanni Carlo Mazzola; Marco Rotondi; Giuliana Lo Presti; Zaharudin Haron; Sara Alessi; Paola Pricolo; Francesco Alessandro Mistretta; Stefano Luzzago; Federica Cattani; Gennaro Musi; Ottavio De Cobelli; Marta Cremonesi; Roberto Orecchia; Giulia Marvaso; Giuseppe Petralia; Barbara Alicja Jereczek-Fossa

doi:10.1186/s13244-022-01276-7

Quality assurance for automatically generated contours with additional deep learning

Insights Imaging. 2022 Aug 17;13(1):137. doi: 10.1186/s13244-022-01276-7.

Authors

Lars Johannes Isaksson¹, Paul Summers², Abhir Bhalerao³, Sara Gandini⁴, Sara Raimondi⁴, Matteo Pepa⁵, Mattia Zaffaroni⁵, Giulia Corrao^{5

6}, Giovanni Carlo Mazzola^{5

6}, Marco Rotondi^{5

6}, Giuliana Lo Presti⁴, Zaharudin Haron⁷, Sara Alessi², Paola Pricolo², Francesco Alessandro Mistretta⁸, Stefano Luzzago⁸, Federica Cattani⁹, Gennaro Musi^{6

8}, Ottavio De Cobelli^{6

8}, Marta Cremonesi¹⁰, Roberto Orecchia¹¹, Giulia Marvaso^{5

6}, Giuseppe Petralia^{6

12}, Barbara Alicja Jereczek-Fossa^{5

6}

Affiliations

¹ Division of Radiation Oncology, IEO European Institute of Oncology IRCCS, Milan, Italy. larsjohannes.isaksson@ieo.it.
² Division of Radiology, IEO European Institute of Oncology IRCCS, Milan, Italy.
³ Department of Computer Science, University of Warwick, Coventry, Warwick, CV4 7AL, UK.
⁴ Molecular and Pharmaco-Epidemiology Unit, Department of Experimental Oncology, IEO European Institute of Oncology IRCCS, Milan, Italy.
⁵ Division of Radiation Oncology, IEO European Institute of Oncology IRCCS, Milan, Italy.
⁶ Department of Oncology and Hemato-Oncology, University of Milan, Milan, Italy.
⁷ Radiology Department, National Cancer Institute, Putrajaya, Malaysia.
⁸ Division of Urology, IEO European Institute of Oncology IRCCS, Milan, Italy.
⁹ Medical Physics Unit, IEO European Institute of Oncology IRCCS, Milan, Italy.
¹⁰ Radiation Research Unit, IEO European Institute of Oncology IRCCS, Milan, Italy.
¹¹ Scientific Direction, IEO European Institute of Oncology IRCCS, Milan, Italy.
¹² Precision Imaging and Research Unit, Department of Medical Imaging and Radiation Sciences, IEO European Institute of Oncology IRCCS, Milan, Italy.

Abstract

Objective: Deploying an automatic segmentation model in practice should require rigorous quality assurance (QA) and continuous monitoring of the model's use and performance, particularly in high-stakes scenarios such as healthcare. Currently, however, tools to assist with QA for such models are not available to AI researchers. In this work, we build a deep learning model that estimates the quality of automatically generated contours.

Methods: The model was trained to predict the segmentation quality by outputting an estimate of the Dice similarity coefficient given an image contour pair as input. Our dataset contained 60 axial T2-weighted MRI images of prostates with ground truth segmentations along with 80 automatically generated segmentation masks. The model we used was a 3D version of the EfficientDet architecture with a custom regression head. For validation, we used a fivefold cross-validation. To counteract the limitation of the small dataset, we used an extensive data augmentation scheme capable of producing virtually infinite training samples from a single ground truth label mask. In addition, we compared the results against a baseline model that only uses clinical variables for its predictions.

Results: Our model achieved a mean absolute error of 0.020 ± 0.026 (2.2% mean percentage error) in estimating the Dice score, with a rank correlation of 0.42. Furthermore, the model managed to correctly identify incorrect segmentations (defined in terms of acceptable/unacceptable) 99.6% of the time.

Conclusion: We believe that the trained model can be used alongside automatic segmentation tools to ensure quality and thus allow intervention to prevent undesired segmentation behavior.

Keywords: Confidence calibration; Diagnostic imaging; Magnetic resonance imaging; Prostate; Quality assurance (Health care).