A deep learning approach to private data sharing of medical images using conditional generative adversarial networks (GANs)

Hanxi Sun; Jason Plawinski; Sajanth Subramaniam; Amir Jamaludin; Timor Kadir; Aimee Readie; Gregory Ligozio; David Ohlssen; Mark Baillie; Thibaud Coroller

doi:10.1371/journal.pone.0280316

A deep learning approach to private data sharing of medical images using conditional generative adversarial networks (GANs)

PLoS One. 2023 Jul 6;18(7):e0280316. doi: 10.1371/journal.pone.0280316. eCollection 2023.

Authors

Affiliations

¹ Department of Statistics, Purdue University, West Lafayette, IN, United States of America.
² Novartis Pharmaceutical Corporation, East Hanover, New Jersey, United States of America.
³ Oxford Big Data Institute, Oxford, United Kingdom.
⁴ Plexalis Ltd, Oxford, United Kingdom.

Abstract

Clinical data sharing can facilitate data-driven scientific research, allowing a broader range of questions to be addressed and thereby leading to greater understanding and innovation. However, sharing biomedical data can put sensitive personal information at risk. This is usually addressed by data anonymization, which is a slow and expensive process. An alternative to anonymization is construction of a synthetic dataset that behaves similar to the real clinical data but preserves patient privacy. As part of a collaboration between Novartis and the Oxford Big Data Institute, a synthetic dataset was generated based on images from COSENTYX® (secukinumab) ankylosing spondylitis (AS) clinical studies. An auxiliary classifier Generative Adversarial Network (ac-GAN) was trained to generate synthetic magnetic resonance images (MRIs) of vertebral units (VUs), conditioned on the VU location (cervical, thoracic and lumbar). Here, we present a method for generating a synthetic dataset and conduct an in-depth analysis on its properties along three key metrics: image fidelity, sample diversity and dataset privacy.

Copyright: © 2023 Sun et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Academies and Institutes
Benchmarking
Big Data
Deep Learning*
Humans
Image Processing, Computer-Assisted
Information Dissemination

Grants and funding

The study was sponsored by Novartis Pharma AG. Novartis personnel and academic advisors from Oxford Big Data Institute (BDI) designed the project.