Can Deep Learning Replace Gadolinium in Neuro-Oncology?: A Reader Study

Invest Radiol. 2022 Feb 1;57(2):99-107. doi: 10.1097/RLI.0000000000000811.

Abstract

Materials and methods: This monocentric retrospective study leveraged 200 multiparametric brain MRIs acquired between November 2019 and February 2020 at Gustave Roussy Cancer Campus (Villejuif, France). A total of 145 patients were included: 107 formed the training sample (55 ± 14 years, 58 women) and 38 the separate test sample (62 ± 12 years, 22 women). Patients had glioma, brain metastases, meningioma, or no enhancing lesion. T1, T2-FLAIR, diffusion-weighted imaging, low-dose, and standard-dose postcontrast T1 sequences were acquired. A deep network was trained to process the precontrast and low-dose sequences to predict "virtual" surrogate images for contrast-enhanced T1. Once trained, the deep learning method was evaluated on the test sample. The discrepancies between the predicted virtual images and the standard-dose MRIs were qualitatively and quantitatively evaluated using both automated voxel-wise metrics and a reader study, where 2 radiologists graded image qualities and marked all visible enhancing lesions.

Results: The automated analysis of the test brain MRIs computed a structural similarity index of 87.1% ± 4.8% between the predicted virtual sequences and the reference contrast-enhanced T1 MRIs, a peak signal-to-noise ratio of 31.6 ± 2.0 dB, and an area under the curve of 96.4% ± 3.1%. At Youden's operating point, the voxel-wise sensitivity (SE) and specificity were 96.4% and 94.8%, respectively. The reader study found that virtual images were preferred to standard-dose MRI in terms of image quality (P = 0.008). A total of 91 reference lesions were identified in the 38 test T1 sequences enhanced with full dose of contrast agent. On average across readers, the brain lesion SE of the virtual images was 83% for lesions larger than 10 mm (n = 42), and the associated false detection rate was 0.08 lesion/patient. The corresponding positive predictive value of detected lesions was 92%, and the F1 score was 88%. Lesion detection performance, however, dropped when smaller lesions were included: average SE was 67% for lesions larger than 5 mm (n = 74), and 56% with all lesions included regardless of their size. The false detection rate remained below 0.50 lesion/patient in all cases, and the positive predictive value remained above 73%. The composite F1 score was 63% at worst.

Conclusions: The proposed deep learning method for virtual contrast-enhanced T1 brain MRI prediction showed very high quantitative performance when evaluated with standard voxel-wise metrics. The reader study demonstrated that, for lesions larger than 10 mm, good detection performance could be maintained despite a 4-fold division in contrast agent usage, unveiling a promising avenue for reducing the gadolinium exposure of returning patients. Small lesions proved, however, difficult to handle for the deep network, showing that full-dose injections remain essential for accurate first-line diagnosis in neuro-oncology.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Brain Neoplasms* / diagnostic imaging
  • Contrast Media
  • Deep Learning*
  • Female
  • Gadolinium
  • Humans
  • Magnetic Resonance Imaging / methods
  • Retrospective Studies

Substances

  • Contrast Media
  • Gadolinium