Label noise and self-learning label correction in cardiac abnormalities classification

Cristina Gallego Vázquez; Alexander Breuss; Oriella Gnarra; Julian Portmann; Antonio Madaffari; Giulia Da Poian

doi:10.1088/1361-6579/ac89cb

Label noise and self-learning label correction in cardiac abnormalities classification

Physiol Meas. 2022 Sep 5;43(9). doi: 10.1088/1361-6579/ac89cb.

Authors

Cristina Gallego Vázquez¹, Alexander Breuss¹, Oriella Gnarra^{1

2}, Julian Portmann³, Antonio Madaffari⁴, Giulia Da Poian¹

Affiliations

¹ Sensory-Motor Systems (SMS) Lab, Department of Health Sciences and Technology, ETH Zurich, Switzerland.
² Sleep-Wake-Epilepsy-Center, Department of Neurology, Bern University Hospital (Inselspital), Switzerland.
³ Department of Computer Science, ETH Zurich, Switzerland.
⁴ Cardiovascular Center, University Clinic for Cardiology, Bern University Hospital (Inselspital), Switzerland.

PMID: 35970176
DOI: 10.1088/1361-6579/ac89cb

Abstract

Objective. Learning to classify cardiac abnormalities requires large and high-quality labeled datasets, which is a challenge in medical applications. Small datasets from various sources are often aggregated to meet this requirement, resulting in a final dataset prone to label noise due to inter- and intra-observer variability and different expertise. It is well known that label noise can affect the performance and generalizability of the trained models. In this work, we explore the impact of label noise and self-learning label correction on the classification of cardiac abnormalities on large heterogeneous datasets of electrocardiogram (ECG) signals.Approach.A state-of-the-art self-learning multi-class label correction method for image classification is adapted to learn a multi-label classifier for electrocardiogram signals. We evaluated our performance using 5-fold cross-validation on the publicly available PhysioNet/Computing in Cardiology (CinC) 2021 Challenge data, with full and reduced sets of leads. Due to the unknown label noise in the testing set, we tested our approach on the MNIST dataset. We investigated the performance under different levels of structured label noise for both datasets.Main results.Under high levels of noise, the cross-validation results of self-learning label correction show an improvement of approximately 3% in the challenge score for the PhysioNet/CinC 2021 Challenge dataset and an improvement in accuracy of 5% and reduction of the expected calibration error of 0.03 for the MNIST dataset. We demonstrate that self-learning label correction can be used to effectively deal with the presence of unknown label noise, also when using a reduced number of ECG leads.

Keywords: ECG; classification; deep learning; label noise.

Creative Commons Attribution license.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Electrocardiography* / methods
Humans
Observer Variation
Signal-To-Noise Ratio