Improving generalization performance of electrocardiogram classification models

Hyeongrok Han; Seongjae Park; Seonwoo Min; Eunji Kim; HyunGi Kim; Sangha Park; Jin-Kook Kim; Junsang Park; Junho An; Kwanglo Lee; Wonsun Jeong; Sangil Chon; Kwon-Woo Ha; Myungkyu Han; Hyun-Soo Choi; Sungroh Yoon

doi:10.1088/1361-6579/acb30f

Improving generalization performance of electrocardiogram classification models

Physiol Meas. 2023 May 10;44(5). doi: 10.1088/1361-6579/acb30f.

Authors

Hyeongrok Han¹, Seongjae Park², Seonwoo Min¹, Eunji Kim¹, HyunGi Kim¹, Sangha Park¹, Jin-Kook Kim², Junsang Park², Junho An², Kwanglo Lee², Wonsun Jeong², Sangil Chon², Kwon-Woo Ha², Myungkyu Han², Hyun-Soo Choi^{3

4}, Sungroh Yoon^{1

5}

Affiliations

¹ Department of Electrical and Computer engineering, Seoul National University, Seoul, Republic of Korea.
² HUINNO Co., Ltd, Seoul, Republic of Korea.
³ Department of Computer Science and Engineering, Seoul National University of Science and Technology, Seoul, Republic of Korea.
⁴ ZIOVISION Inc., Chuncheon, Republic of Korea.
⁵ Interdisciplinary Program in Artificial Intelligence, Seoul National University, Seoul, Republic of Korea.

PMID: 36638544
DOI: 10.1088/1361-6579/acb30f

Abstract

Objective.Recently, many electrocardiogram (ECG) classification algorithms using deep learning have been proposed. Because the ECG characteristics vary across datasets owing to variations in factors such as recorded hospitals and the race of participants, the model needs to have a consistently high generalization performance across datasets. In this study, as part of the PhysioNet/Computing in Cardiology Challenge (PhysioNet Challenge) 2021, we present a model to classify cardiac abnormalities from the 12- and the reduced-lead ECGs.Approach.To improve the generalization performance of our earlier proposed model, we adopted a practical suite of techniques, i.e. constant-weighted cross-entropy loss, additional features, mixup augmentation, squeeze/excitation block, and OneCycle learning rate scheduler. We evaluated its generalization performance using the leave-one-dataset-out cross-validation setting. Furthermore, we demonstrate that the knowledge distillation from the 12-lead and large-teacher models improved the performance of the reduced-lead and small-student models.Main results.With the proposed model, our DSAIL SNU team has received Challenge scores of 0.55, 0.58, 0.58, 0.57, and 0.57 (ranked 2nd, 1st, 1st, 2nd, and 2nd of 39 teams) for the 12-, 6-, 4-, 3-, and 2-lead versions of the hidden test set, respectively.Significance.The proposed model achieved a higher generalization performance over six different hidden test datasets than the one we submitted to the PhysioNet Challenge 2020.

Keywords: ECG; artificial intelligence; biomedical engineering; cardiovascular disease; deep learning; knowledge distillation.

Creative Commons Attribution license.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Atrial Fibrillation*
Electrocardiography / methods
Entropy
Humans