Deep Generative Models: The winning key for large and easily accessible ECG datasets?

Comput Biol Med. 2023 Dec:167:107655. doi: 10.1016/j.compbiomed.2023.107655. Epub 2023 Nov 2.

Abstract

Large high-quality datasets are essential for building powerful artificial intelligence (AI) algorithms capable of supporting advancement in cardiac clinical research. However, researchers working with electrocardiogram (ECG) signals struggle to get access and/or to build one. The aim of the present work is to shed light on a potential solution to address the lack of large and easily accessible ECG datasets. Firstly, the main causes of such a lack are identified and examined. Afterward, the potentials and limitations of cardiac data generation via deep generative models (DGMs) are deeply analyzed. These very promising algorithms have been found capable not only of generating large quantities of ECG signals but also of supporting data anonymization processes, to simplify data sharing while respecting patients' privacy. Their application could help research progress and cooperation in the name of open science. However several aspects, such as a standardized synthetic data quality evaluation and algorithm stability, need to be further explored.

Keywords: Anonymization; Data augmentation; Data scarcity; Data sharing; Deep generative models; Diffusion models; ECG synthesis; GAN; Open science; Variational autoencoders.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Artificial Intelligence*
  • Data Accuracy
  • Electrocardiography*
  • Heart
  • Humans