Deep Learning Algorithms for Estimation of Demographic and Anthropometric Features from Electrocardiograms

J Clin Med. 2023 Apr 12;12(8):2828. doi: 10.3390/jcm12082828.

Abstract

The electrocardiogram (ECG) has been known to be affected by demographic and anthropometric factors. This study aimed to develop deep learning models to predict the subject's age, sex, ABO blood type, and body mass index (BMI) based on ECGs. This retrospective study included individuals aged 18 years or older who visited a tertiary referral center with ECGs acquired from October 2010 to February 2020. Using convolutional neural networks (CNNs) with three convolutional layers, five kernel sizes, and two pooling sizes, we developed both classification and regression models. We verified a classification model to be applicable for age (<40 years vs. ≥40 years), sex (male vs. female), BMI (<25 kg/m2 vs. ≥25 kg/m2), and ABO blood type. A regression model was also developed and validated for age and BMI estimation. A total of 124,415 ECGs (1 ECG per subject) were included. The dataset was constructed by dividing the entire set of ECGs at a ratio of 4:3:3. In the classification task, the area under the receiver operating characteristic (AUROC), which represents a quantitative indicator of the judgment threshold, was used as the primary outcome. The mean absolute error (MAE), which represents the difference between the observed and estimated values, was used in the regression task. For age estimation, the CNN achieved an AUROC of 0.923 with an accuracy of 82.97%, and a MAE of 8.410. For sex estimation, the AUROC was 0.947 with an accuracy of 86.82%. For BMI estimation, the AUROC was 0.765 with an accuracy of 69.89%, and a MAE of 2.332. For ABO blood type estimation, the CNN showed an inferior performance, with a top-1 accuracy of 31.98%. For the ABO blood type estimation, the CNN showed an inferior performance, with a top-1 accuracy of 31.98% (95% CI, 31.98-31.98%). Our model could be adapted to estimate individuals' demographic and anthropometric features from their ECGs; this would enable the development of physiologic biomarkers that can better reflect their health status than chronological age.

Keywords: BMI; age; artificial intelligence; blood type; deep learning; demographics; electrocardiography; sex.