Predicting 30-day hospital readmissions using artificial neural networks with medical code embedding

PLoS One. 2020 Apr 15;15(4):e0221606. doi: 10.1371/journal.pone.0221606. eCollection 2020.

Abstract

Reducing unplanned readmissions is a major focus of current hospital quality efforts. In order to avoid unfair penalization, administrators and policymakers use prediction models to adjust for the performance of hospitals from healthcare claims data. Regression-based models are a commonly utilized method for such risk-standardization across hospitals; however, these models often suffer in accuracy. In this study we, compare four prediction models for unplanned patient readmission for patients hospitalized with acute myocardial infarction (AMI), congestive health failure (HF), and pneumonia (PNA) within the Nationwide Readmissions Database in 2014. We evaluated hierarchical logistic regression and compared its performance with gradient boosting and two models that utilize artificial neural networks. We show that unsupervised Global Vector for Word Representations embedding representations of administrative claims data combined with artificial neural network classification models improves prediction of 30-day readmission. Our best models increased the AUC for prediction of 30-day readmissions from 0.68 to 0.72 for AMI, 0.60 to 0.64 for HF, and 0.63 to 0.68 for PNA compared to hierarchical logistic regression. Furthermore, risk-standardized hospital readmission rates calculated from our artificial neural network model that employed embeddings led to reclassification of approximately 10% of hospitals across categories of hospital performance. This finding suggests that prediction models that incorporate new methods classify hospitals differently than traditional regression-based approaches and that their role in assessing hospital performance warrants further investigation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aged
  • Aged, 80 and over
  • Female
  • Humans
  • Logistic Models
  • Male
  • Middle Aged
  • Models, Biological
  • Myocardial Infarction / diagnosis*
  • Neural Networks, Computer
  • Patient Readmission*
  • Prognosis
  • Risk Assessment

Grants and funding

MIDAS Challenge award Michigan Institute for Data Science https://midas.umich.edu/ The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.