A data-driven feature extraction framework for predicting the severity of condition of congestive heart failure patients

Annu Int Conf IEEE Eng Med Biol Soc. 2015:2015:2534-7. doi: 10.1109/EMBC.2015.7318908.

Abstract

In this paper, we propose a novel methodology for utilizing disease diagnostic information to predict severity of condition for Congestive Heart Failure (CHF) patients. Our methodology relies on a novel, clustering-based, feature extraction framework using disease diagnostic information. To reduce the dimensionality we identify disease clusters using cooccurence frequencies. We then utilize these clusters as features to predict patient severity of condition. We build our clustering and feature extraction algorithm using the 2012 National Inpatient Sample (NIS), Healthcare Cost and Utilization Project (HCUP) which contains 7 million discharge records and ICD-9-CM codes. The proposed framework is tested on Ronald Reagan UCLA Medical Center Electronic Health Records (EHR) from 3041 patients. We compare our cluster-based feature set with another that incorporates the Charlson comorbidity score as a feature and demonstrate an accuracy improvement of up to 14% in the predictability of the severity of condition.

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Electronic Health Records*
  • Heart Failure / diagnosis*
  • Heart Failure / epidemiology*
  • Humans
  • Inpatients
  • International Classification of Diseases
  • Models, Theoretical
  • Patient Discharge
  • Reproducibility of Results