Effect of dataset size, image quality, and image type on deep learning-based automatic prostate segmentation in 3D ultrasound

Nathan Orlando; Igor Gyacskov; Derek J Gillies; Fumin Guo; Cesare Romagnoli; David D'Souza; Derek W Cool; Douglas A Hoover; Aaron Fenster

doi:10.1088/1361-6560/ac5a93

Effect of dataset size, image quality, and image type on deep learning-based automatic prostate segmentation in 3D ultrasound

Phys Med Biol. 2022 Mar 29;67(7). doi: 10.1088/1361-6560/ac5a93.

Authors

Nathan Orlando^{1

2}, Igor Gyacskov², Derek J Gillies³, Fumin Guo⁴, Cesare Romagnoli^{3

5}, David D'Souza^{3

6}, Derek W Cool^{3

5}, Douglas A Hoover^{1

3

6}, Aaron Fenster^{1

2

5

6}

Affiliations

¹ Department of Medical Biophysics, Western University, London, Ontario N6A 3K7, Canada.
² Robarts Research Institute, Western University, London, Ontario N6A 3K7, Canada.
³ London Health Sciences Centre, London, Ontario N6A 5W9, Canada.
⁴ Department of Medical Biophysics, University of Toronto, Toronto, Ontario M4N 3M5, Canada.
⁵ Department of Medical Imaging, Western University, London, Ontario N6A 3K7, Canada.
⁶ Department of Oncology, Western University, London, Ontario N6A 3K7, Canada.

PMID: 35240585
DOI: 10.1088/1361-6560/ac5a93

Abstract

Three-dimensional (3D) transrectal ultrasound (TRUS) is utilized in prostate cancer diagnosis and treatment, necessitating time-consuming manual prostate segmentation. We have previously developed an automatic 3D prostate segmentation algorithm involving deep learning prediction on radially sampled 2D images followed by 3D reconstruction, trained on a large, clinically diverse dataset with variable image quality. As large clinical datasets are rare, widespread adoption of automatic segmentation could be facilitated with efficient 2D-based approaches and the development of an image quality grading method. The complete training dataset of 6761 2D images, resliced from 206 3D TRUS volumes acquired using end-fire and side-fire acquisition methods, was split to train two separate networks using either end-fire or side-fire images. Split datasets were reduced to 1000, 500, 250, and 100 2D images. For deep learning prediction, modified U-Net and U-Net++ architectures were implemented and compared using an unseen test dataset of 40 3D TRUS volumes. A 3D TRUS image quality grading scale with three factors (acquisition quality, artifact severity, and boundary visibility) was developed to assess the impact on segmentation performance. For the complete training dataset, U-Net and U-Net++ networks demonstrated equivalent performance, but when trained using split end-fire/side-fire datasets, U-Net++ significantly outperformed the U-Net. Compared to the complete training datasets, U-Net++ trained using reduced-size end-fire and side-fire datasets demonstrated equivalent performance down to 500 training images. For this dataset, image quality had no impact on segmentation performance for end-fire images but did have a significant effect for side-fire images, with boundary visibility having the largest impact. Our algorithm provided fast (<1.5 s) and accurate 3D segmentations across clinically diverse images, demonstrating generalizability and efficiency when employed on smaller datasets, supporting the potential for widespread use, even when data is scarce. The development of an image quality grading scale provides a quantitative tool for assessing segmentation performance.

Keywords: 3D ultrasound prostate segmentation; biopsy; brachytherapy; deep learning; image quality; prostate cancer; small dataset.

Creative Commons Attribution license.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Deep Learning*
Humans
Male
Pelvis
Prostate / diagnostic imaging
Prostatic Neoplasms* / diagnostic imaging
Ultrasonography

Grants and funding

CIHR/Canada