Pruning and thresholding approach for methylation risk scores in multi-ancestry populations

Epigenetics. 2023 Dec;18(1):2187172. doi: 10.1080/15592294.2023.2187172.

Abstract

Recent efforts have focused on developing methylation risk scores (MRS), a weighted sum of the individual's DNA methylation (DNAm) values of pre-selected CpG sites. Most of the current MRS approaches that utilize Epigenome-wide association studies (EWAS) summary statistics only include genome-wide significant CpG sites and do not consider co-methylation. New methods that relax the p-value threshold to include more CpG sites and account for the inter-correlation of DNAm might improve the predictive performance of MRS. We paired informed co-methylation pruning with P-value thresholding to generate pruning and thresholding (P+T) MRS and evaluated its performance among multi-ancestry populations. Through simulation studies and real data analyses, we demonstrated that pruning provides an improvement over simple thresholding methods for prediction of phenotypes. We demonstrated that European-derived summary statistics can be used to develop P+T MRS among other populations such as African populations. However, the prediction accuracy of P+T MRS may differ across multi-ancestry population due to environmental/cultural/social differences.

Keywords: Admixed population; Clumping and thresholding; Epigenetic scores; Polygenic DNA methylation.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • CpG Islands
  • DNA Methylation*
  • Epigenome*
  • Genome-Wide Association Study
  • Phenotype
  • Risk Factors