Fully Automatic Assessment of Background Parenchymal Enhancement on Breast MRI Using Machine-Learning Models

Yoonho Nam; Ga Eun Park; Junghwa Kang; Sung Hun Kim

doi:10.1002/jmri.27429

Fully Automatic Assessment of Background Parenchymal Enhancement on Breast MRI Using Machine-Learning Models

J Magn Reson Imaging. 2021 Mar;53(3):818-826. doi: 10.1002/jmri.27429. Epub 2020 Nov 20.

Authors

Yoonho Nam^{1

2}, Ga Eun Park¹, Junghwa Kang², Sung Hun Kim¹

Affiliations

¹ Department of Radiology, Seoul St. Mary's Hospital, College of Medicine, The Catholic University of Korea, Seoul, Republic of Korea.
² Division of Biomedical Engineering, Hankuk University of Foreign Studies, Yongin, Republic of Korea.

PMID: 33219624
DOI: 10.1002/jmri.27429

Abstract

Background: Automated measurement and classification models with objectivity and reproducibility are required for accurate evaluation of the breast cancer risk of fibroglandular tissue (FGT) and background parenchymal enhancement (BPE).

Purpose: To develop and evaluate a machine-learning algorithm for breast FGT segmentation and BPE classification.

Study type: Retrospective.

Population: A total of 794 patients with breast cancer, 594 patients assigned to the development set, and 200 patients to the test set.

Field strength/sequence: 3T and 1.5T; T₂ -weighted, fat-saturated T₁ -weighted (T₁ W) with dynamic contrast enhancement (DCE).

Assessment: Manual segmentation was performed for the whole breast and FGT regions in the contralateral breast. The BPE region was determined by thresholding using the subtraction of the pre- and postcontrast T₁ W images and the segmented FGT mask. Two radiologists independently assessed the categories of FGT and BPE. A deep-learning-based algorithm was designed to segment and measure the volume of whole breast and FGT and classify the grade of BPE.

Statistical tests: Dice similarity coefficients (DSC) and Spearman correlation analysis were used to compare the volumes from the manual and deep-learning-based segmentations. Kappa statistics were used for agreement analysis. Comparison of area under the receiver operating characteristic (ROC) curves (AUC) and F1 scores were calculated to evaluate the performance of BPE classification.

Results: The mean (±SD) DSC for manual and deep-learning segmentations was 0.85 ± 0.11. The correlation coefficient for FGT volume from manual- and deep-learning-based segmentations was 0.93. Overall accuracy of manual segmentation and deep-learning segmentation in BPE classification task was 66% and 67%, respectively. For binary categorization of BPE grade (minimal/mild vs. moderate/marked), overall accuracy increased to 91.5% in manual segmentation and 90.5% in deep-learning segmentation; the AUC was 0.93 in both methods.

Data conclusion: This deep-learning-based algorithm can provide reliable segmentation and classification results for BPE.

Level of evidence: 3 TECHNICAL EFFICACY STAGE: 2.

Keywords: BPE; MRI; algorithms; breast; deep learning; machine learning.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Breast Neoplasms* / diagnostic imaging
Breast* / diagnostic imaging
Humans
Machine Learning
Magnetic Resonance Imaging
Reproducibility of Results
Retrospective Studies