A deep learning approach to private data sharing of medical images using conditional generative adversarial networks (GANs)

PLoS One. 2023 Jul 6;18(7):e0280316. doi: 10.1371/journal.pone.0280316. eCollection 2023.

Abstract

Clinical data sharing can facilitate data-driven scientific research, allowing a broader range of questions to be addressed and thereby leading to greater understanding and innovation. However, sharing biomedical data can put sensitive personal information at risk. This is usually addressed by data anonymization, which is a slow and expensive process. An alternative to anonymization is construction of a synthetic dataset that behaves similar to the real clinical data but preserves patient privacy. As part of a collaboration between Novartis and the Oxford Big Data Institute, a synthetic dataset was generated based on images from COSENTYX® (secukinumab) ankylosing spondylitis (AS) clinical studies. An auxiliary classifier Generative Adversarial Network (ac-GAN) was trained to generate synthetic magnetic resonance images (MRIs) of vertebral units (VUs), conditioned on the VU location (cervical, thoracic and lumbar). Here, we present a method for generating a synthetic dataset and conduct an in-depth analysis on its properties along three key metrics: image fidelity, sample diversity and dataset privacy.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Academies and Institutes
  • Benchmarking
  • Big Data
  • Deep Learning*
  • Humans
  • Image Processing, Computer-Assisted
  • Information Dissemination

Grants and funding

The study was sponsored by Novartis Pharma AG. Novartis personnel and academic advisors from Oxford Big Data Institute (BDI) designed the project.