Sparse Embedding for Interpretable Hospital Admission Prediction

Zepeng Huo; Harinath Sundararajhan; Nathan C Hurley; Adrian Haimovich; R Andrew Taylor; Bobak J Mortazavi

doi:10.1109/EMBC.2019.8856800

Sparse Embedding for Interpretable Hospital Admission Prediction

Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul:2019:3438-3441. doi: 10.1109/EMBC.2019.8856800.

Authors

Zepeng Huo, Harinath Sundararajhan, Nathan C Hurley, Adrian Haimovich, R Andrew Taylor, Bobak J Mortazavi

PMID: 31946618
DOI: 10.1109/EMBC.2019.8856800

Abstract

This paper introduces a sparse embedding for electronic health record (EHR) data in order to predict hospital admission. We use a k-sparse autoencoder to embed the original registry data into a much lower dimension, with sparsity as a goal. Then, t-SNE is used to show the embedding of each patient's data in a 2D plot. We then demonstrate the predictive accuracy in different existing machine learning algorithms. Our sparse embedding performs competitively against the original data and traditional embedding vectors with an AUROC of 0.878. In addition, we demonstrate the expressive power of our sparse embedding, i.e. interpretability. Sparse embedding can discover more phenotypes in t-SNE visualization than original data or traditional embedding. The discovered phenotypes can be regarded as different risk groups, through which we can study the driving risk factors for each patient phenotype.

MeSH terms

Algorithms*
Electronic Health Records*
Forecasting
Hospitalization / trends*
Humans
Machine Learning*
Risk Factors