Development and validation of a heart failure with preserved ejection fraction cohort using electronic medical records

BMC Cardiovasc Disord. 2018 Jun 28;18(1):128. doi: 10.1186/s12872-018-0866-5.

Abstract

Background: Heart failure (HF) with preserved ejection fraction (HFpEF) comprises nearly half of prevalent HF, yet is challenging to curate in a large database of electronic medical records (EMR) since it requires both accurate HF diagnosis and left ventricular ejection fraction (EF) values to be consistently ≥50%.

Methods: We used the national Veterans Affairs EMR to curate a cohort of HFpEF patients from 2002 to 2014. EF values were extracted from clinical documents utilizing natural language processing and an iterative approach was used to refine the algorithm for verification of clinical HFpEF. The final algorithm utilized the following inclusion criteria: any International Classification of Diseases-9 (ICD-9) code of HF (428.xx); all recorded EF ≥50%; and either B-type natriuretic peptide (BNP) or aminoterminal pro-BNP (NT-proBNP) values recorded OR diuretic use within one month of diagnosis of HF. Validation of the algorithm was performed by 3 independent reviewers doing manual chart review of 100 HFpEF cases and 100 controls.

Results: We established a HFpEF cohort of 80,248 patients (out of a total 1,155,376 patients with the ICD-9 diagnosis of HF). Mean age was 72 years; 96% were males and 12% were African-Americans. Validation analysis of the HFpEF algorithm had a sensitivity of 88%, specificity of 96%, positive predictive value of 96%, and a negative predictive value of 87% to identify HFpEF cases.

Conclusion: We developed a sensitive, highly specific algorithm for detecting HFpEF in a large national database. This approach may be applicable to other large EMR databases to identify HFpEF patients.

Keywords: Electronic medical records; Epidemiology; Heart failure; Preserved ejection fraction; Validation.

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Aged
  • Aged, 80 and over
  • Biomarkers / blood
  • Data Mining / methods*
  • Databases, Factual
  • Diuretics / therapeutic use
  • Echocardiography
  • Electronic Health Records*
  • Female
  • Heart Failure / classification
  • Heart Failure / diagnosis*
  • Heart Failure / epidemiology
  • Heart Failure / physiopathology
  • Humans
  • International Classification of Diseases
  • Male
  • Middle Aged
  • Natriuretic Peptide, Brain / blood
  • Natural Language Processing
  • Peptide Fragments / blood
  • Reproducibility of Results
  • Stroke Volume*
  • United States / epidemiology
  • United States Department of Veterans Affairs
  • Ventricular Function, Left*

Substances

  • Biomarkers
  • Diuretics
  • Peptide Fragments
  • pro-brain natriuretic peptide (1-76)
  • Natriuretic Peptide, Brain