Benchmarking the Impact of Noise on Deep Learning-Based Classification of Atrial Fibrillation in 12-Lead ECG

Theresa Bender; Philip Gemke; Ennio Idrobo-Avila; Henning Dathe; Dagmar Krefting; Nicolai Spicher

doi:10.3233/SHTI230321

Benchmarking the Impact of Noise on Deep Learning-Based Classification of Atrial Fibrillation in 12-Lead ECG

Stud Health Technol Inform. 2023 May 18:302:977-981. doi: 10.3233/SHTI230321.

Authors

Theresa Bender^{1

2}, Philip Gemke¹, Ennio Idrobo-Avila¹, Henning Dathe¹, Dagmar Krefting^{1

2}, Nicolai Spicher^{1

2}

Affiliations

¹ Department of Medical Informatics, University Medical Center Göttingen, Göttingen, Germany.
² DZHK (German Centre for Cardiovascular Research), partner site Göttingen, Göttingen, Germany.

PMID: 37203548
DOI: 10.3233/SHTI230321

Abstract

Electrocardiography analysis is widely used in various clinical applications and Deep Learning models for classification tasks are currently in the focus of research. Due to their data-driven character, they bear the potential to handle signal noise efficiently, but its influence on the accuracy of these methods is still unclear. Therefore, we benchmark the influence of four types of noise on the accuracy of a Deep Learning-based method for atrial fibrillation detection in 12-lead electrocardiograms. We use a subset of a publicly available dataset (PTB-XL) and use the metadata provided by human experts regarding noise for assigning a signal quality to each electrocardiogram. Furthermore, we compute a quantitative signal-to-noise ratio for each electrocardiogram. We analyze the accuracy of the Deep Learning model with respect to both metrics and observe that the method can robustly identify atrial fibrillation, even in cases signals are labelled by human experts as being noisy on multiple leads. False positive and false negative rates are slightly worse for data being labelled as noisy. Interestingly, data annotated as showing baseline drift noise results in an accuracy very similar to data without. We conclude that the issue of processing noisy electrocardiography data can be addressed successfully by Deep Learning methods that might not need preprocessing as many conventional methods do.

Keywords: Atrial Fibrillation; Deep Learning; Electrocardiogram; Noise.

MeSH terms

Algorithms
Atrial Fibrillation* / diagnosis
Benchmarking
Deep Learning*
Electrocardiography / methods
Humans
Signal-To-Noise Ratio