External validation of integrated genetic-epigenetic biomarkers for predicting incident coronary heart disease

Epigenomics. 2021 Jul;13(14):1095-1112. doi: 10.2217/epi-2021-0123. Epub 2021 Jun 21.

Abstract

Aim: The Framingham Risk Score (FRS) and atherosclerotic cardiovascular disease (ASCVD) Pooled Cohort Equation (PCE) for predicting risk for incident coronary heart disease (CHD) work poorly. To improve risk stratification for CHD, we developed a novel integrated genetic-epigenetic tool. Materials & methods: Using machine learning techniques and datasets from the Framingham Heart Study (FHS) and Intermountain Healthcare (IM), we developed and validated an integrated genetic-epigenetic model for predicting 3-year incident CHD. Results: Our approach was more sensitive than FRS and PCE and had high generalizability across cohorts. It performed with sensitivity/specificity of 79/75% in the FHS test set and 75/72% in the IM set. The sensitivity/specificity was 15/93% in FHS and 31/89% in IM for FRS, and sensitivity/specificity was 41/74% in FHS and 69/55% in IM for PCE. Conclusion: The use of our tool in a clinical setting could better identify patients at high risk for a heart attack.

Keywords: artificial intelligence; coronary heart disease; digital PCR; epigenetics; genetics; machine learning; prevention.

Plain language summary

Lay abstract Current lipid-based methods for assessing risk for coronary heart disease (CHD) have limitations. Conceivably, incorporating epigenetic information into risk prediction algorithms may be beneficial, but underlying genetic variation obscures its effects on risk. In order to develop a better CHD risk assessment method, we used artificial intelligence to identify genome-wide genetic and epigenetic biomarkers from two independent datasets of subjects characterized for incident CHD. The resulting algorithm significantly outperformed the current assessment methods in independent test sets. We conclude that artificial intelligence-moderated genetic-epigenetic algorithms have considerable potential as clinical tools for assessing risk for CHD.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aged
  • Biomarkers*
  • Computational Biology / methods
  • Coronary Disease / diagnosis
  • Coronary Disease / etiology*
  • Coronary Disease / metabolism
  • Disease Susceptibility*
  • Epigenesis, Genetic
  • Epigenomics* / methods
  • Female
  • Gene Expression Regulation*
  • Genetic Markers
  • Genetic Predisposition to Disease
  • Genomics* / methods
  • Humans
  • Kaplan-Meier Estimate
  • Male
  • Middle Aged
  • Prognosis
  • Reproducibility of Results
  • Risk Assessment
  • Sensitivity and Specificity

Substances

  • Biomarkers
  • Genetic Markers