Implementable Deep Learning for Multi-sequence Proton MRI Lung Segmentation: A Multi-center, Multi-vendor, and Multi-disease Study

Joshua R Astley; Alberto M Biancardi; Paul J C Hughes; Helen Marshall; Guilhem J Collier; Ho-Fung Chan; Laura C Saunders; Laurie J Smith; Martin L Brook; Roger Thompson; Sarah Rowland-Jones; Sarah Skeoch; Stephen M Bianchi; Matthew Q Hatton; Najib M Rahman; Ling-Pei Ho; Chris E Brightling; Louise V Wain; Amisha Singapuri; Rachael A Evans; Alastair J Moss; Gerry P McCann; Stefan Neubauer; Betty Raman; C-MORE/PHOSP-COVID Collaborative Group; Jim M Wild; Bilal A Tahir

doi:10.1002/jmri.28643

Implementable Deep Learning for Multi-sequence Proton MRI Lung Segmentation: A Multi-center, Multi-vendor, and Multi-disease Study

J Magn Reson Imaging. 2023 Oct;58(4):1030-1044. doi: 10.1002/jmri.28643. Epub 2023 Feb 17.

Authors

Joshua R Astley^{1

2}, Alberto M Biancardi¹, Paul J C Hughes¹, Helen Marshall¹, Guilhem J Collier¹, Ho-Fung Chan¹, Laura C Saunders¹, Laurie J Smith¹, Martin L Brook¹, Roger Thompson³, Sarah Rowland-Jones³, Sarah Skeoch^{4

5}, Stephen M Bianchi³, Matthew Q Hatton³, Najib M Rahman⁶, Ling-Pei Ho⁷, Chris E Brightling⁸, Louise V Wain^{8

9}, Amisha Singapuri⁸, Rachael A Evans¹⁰, Alastair J Moss^{8

11}, Gerry P McCann^{8

11}, Stefan Neubauer⁶, Betty Raman⁶; C-MORE/PHOSP-COVID Collaborative Group; Jim M Wild^{1

12}, Bilal A Tahir^{1

2

12}

Affiliations

¹ POLARIS, Department of Infection, Immunity & Cardiovascular Disease, The University of Sheffield, Sheffield, UK.
² Department of Oncology and Metabolism, The University of Sheffield, Sheffield, UK.
³ Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, UK.
⁴ Royal National Hospital for Rheumatic Diseases, Royal United Hospital NHS Foundation Trust, Bath, UK.
⁵ Arthritis Research UK Centre for Epidemiology, Division of Musculoskeletal and Dermatological Sciences, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Academic Health Sciences Centre, Manchester, UK.
⁶ Division of Cardiovascular Medicine, Radcliffe Department of Medicine, National Institute for Health Research (NIHR) Oxford Biomedical Research Centre (BRC), University of Oxford, Oxford, UK.
⁷ MRC Human Immunology Unit, University of Oxford, Oxford, UK.
⁸ The Institute for Lung Health, NIHR Leicester Biomedical Research Centre, University of Leicester, Leicester, UK.
⁹ Department of Health sciences, University of Leicester, Leicester, UK.
¹⁰ University Hospitals of Leicester NHS Trust, University of Leicester, Leicester, UK.
¹¹ Department of Cardiovascular Sciences, University of Leicester, Leicester, UK.
¹² Insigneo Institute for In Silico Medicine, The University of Sheffield, Sheffield, UK.

Abstract

Background: Recently, deep learning via convolutional neural networks (CNNs) has largely superseded conventional methods for proton (¹ H)-MRI lung segmentation. However, previous deep learning studies have utilized single-center data and limited acquisition parameters.

Purpose: Develop a generalizable CNN for lung segmentation in ¹ H-MRI, robust to pathology, acquisition protocol, vendor, and center.

Study type: Retrospective.

Population: A total of 809 ¹ H-MRI scans from 258 participants with various pulmonary pathologies (median age (range): 57 (6-85); 42% females) and 31 healthy participants (median age (range): 34 (23-76); 34% females) that were split into training (593 scans (74%); 157 participants (55%)), testing (50 scans (6%); 50 participants (17%)) and external validation (164 scans (20%); 82 participants (28%)) sets.

Field strength/sequence: 1.5-T and 3-T/3D spoiled-gradient recalled and ultrashort echo-time ¹ H-MRI.

Assessment: 2D and 3D CNNs, trained on single-center, multi-sequence data, and the conventional spatial fuzzy c-means (SFCM) method were compared to manually delineated expert segmentations. Each method was validated on external data originating from several centers. Dice similarity coefficient (DSC), average boundary Hausdorff distance (Average HD), and relative error (XOR) metrics to assess segmentation performance.

Statistical tests: Kruskal-Wallis tests assessed significances of differences between acquisitions in the testing set. Friedman tests with post hoc multiple comparisons assessed differences between the 2D CNN, 3D CNN, and SFCM. Bland-Altman analyses assessed agreement with manually derived lung volumes. A P value of <0.05 was considered statistically significant.

Results: The 3D CNN significantly outperformed its 2D analog and SFCM, yielding a median (range) DSC of 0.961 (0.880-0.987), Average HD of 1.63 mm (0.65-5.45) and XOR of 0.079 (0.025-0.240) on the testing set and a DSC of 0.973 (0.866-0.987), Average HD of 1.11 mm (0.47-8.13) and XOR of 0.054 (0.026-0.255) on external validation data.

Data conclusion: The 3D CNN generated accurate ¹ H-MRI lung segmentations on a heterogenous dataset, demonstrating robustness to disease pathology, sequence, vendor, and center.

Evidence level: 4.

Technical efficacy: Stage 1.

Keywords: CNN; deep learning; lung; segmentation.

Publication types

Multicenter Study
Research Support, Non-U.S. Gov't

MeSH terms

Deep Learning*
Female
Humans
Image Processing, Computer-Assisted / methods
Lung / diagnostic imaging
Magnetic Resonance Imaging / methods
Male
Protons
Retrospective Studies

Substances

Protons

Abstract

Publication types

MeSH terms

Substances

Grants and funding