Federated Learning: A Cross-Institutional Feasibility Study of Deep Learning Based Intracranial Tumor Delineation Framework for Stereotactic Radiosurgery

J Magn Reson Imaging. 2024 Jun;59(6):1967-1975. doi: 10.1002/jmri.28950. Epub 2023 Aug 12.

Abstract

Background: Deep learning-based segmentation algorithms usually required large or multi-institute data sets to improve the performance and ability of generalization. However, protecting patient privacy is a key concern in the multi-institutional studies when conventional centralized learning (CL) is used.

Purpose: To explores the feasibility of a proposed lesion delineation for stereotactic radiosurgery (SRS) scheme for federated learning (FL), which can solve decentralization and privacy protection concerns.

Study type: Retrospective.

Subjects: 506 and 118 vestibular schwannoma patients aged 15-88 and 22-85 from two institutes, respectively; 1069 and 256 meningioma patients aged 12-91 and 23-85, respectively; 574 and 705 brain metastasis patients aged 26-92 and 28-89, respectively.

Field strength/sequence: 1.5T, spin-echo, and gradient-echo [Correction added after first online publication on 21 August 2023. Field Strength has been changed to "1.5T" from "5T" in this sentence.].

Assessment: The proposed lesion delineation method was integrated into an FL framework, and CL models were established as the baseline. The effect of image standardization strategies was also explored. The dice coefficient was used to evaluate the segmentation between the predicted delineation and the ground truth, which was manual delineated by neurosurgeons and a neuroradiologist.

Statistical tests: The paired t-test was applied to compare the mean for the evaluated dice scores (p < 0.05).

Results: FL performed the comparable mean dice coefficient to CL for the testing set of Taipei Veterans General Hospital regardless of standardization and parameter; for the Taichung Veterans General Hospital data, CL significantly (p < 0.05) outperformed FL while using bi-parameter, but comparable results while using single-parameter. For the non-SRS data, FL achieved the comparable applicability to CL with mean dice 0.78 versus 0.78 (without standardization), and outperformed to the baseline models of two institutes.

Data conclusion: The proposed lesion delineation successfully implemented into an FL framework. The FL models were applicable on SRS data of each participating institute, and the FL exhibited comparable mean dice coefficient to CL on non-SRS dataset. Standardization strategies would be recommended when FL is used.

Level of evidence: 4 TECHNICAL EFFICACY: Stage 1.

Keywords: convolutional neural network; federated learning; magnetic resonance imaging; meningioma; metastasis; vestibular schwannoma.

Publication types

  • Research Support, Non-U.S. Gov't
  • Multicenter Study

MeSH terms

  • Adolescent
  • Adult
  • Aged
  • Aged, 80 and over
  • Algorithms
  • Brain Neoplasms* / diagnostic imaging
  • Brain Neoplasms* / radiotherapy
  • Child
  • Deep Learning*
  • Feasibility Studies*
  • Female
  • Humans
  • Image Processing, Computer-Assisted / methods
  • Magnetic Resonance Imaging / methods
  • Male
  • Meningioma / diagnostic imaging
  • Meningioma / radiotherapy
  • Meningioma / surgery
  • Middle Aged
  • Neuroma, Acoustic / diagnostic imaging
  • Neuroma, Acoustic / radiotherapy
  • Neuroma, Acoustic / surgery
  • Radiosurgery* / methods
  • Retrospective Studies
  • Young Adult