Validity of polygenic risk scores: are we measuring what we think we are?

Hum Mol Genet. 2019 Nov 21;28(R2):R143-R150. doi: 10.1093/hmg/ddz205.

Abstract

Polygenic risk scores (PRSs) have become the standard for quantifying genetic liability in the prediction of disease risks. PRSs are generally constructed as weighted sum scores of risk alleles using effect sizes from genome-wide association studies as their weights. The construction of PRSs is being improved with more appropriate selection of independent single-nucleotide polymorphisms (SNPs) and optimized estimation of their weights but is rarely reflected upon from a theoretical perspective, focusing on the validity of the risk score. Borrowing from psychometrics, this paper discusses the validity of PRSs and introduces the three main types of validity that are considered in the evaluation of tests and measurements: construct, content, and criterion validity. This introduction is followed by a discussion of three topics that challenge the validity of PRS, namely, their claimed independence of clinical risk factors, the consequences of relaxing SNP inclusion thresholds and the selection of SNP weights. This discussion of the validity of PRS reminds us that we need to keep questioning if weighted sums of risk alleles are measuring what we think they are in the various scenarios in which PRSs are used and that we need to keep exploring alternative modeling strategies that might better reflect the underlying biological pathways.

Publication types

  • Review

MeSH terms

  • Alleles
  • Diabetes Mellitus, Type 2 / epidemiology
  • Diabetes Mellitus, Type 2 / genetics
  • Genetic Predisposition to Disease*
  • Genome-Wide Association Study*
  • Genotype
  • Humans
  • Models, Genetic
  • Multifactorial Inheritance*
  • Polymorphism, Single Nucleotide
  • Psychometrics / instrumentation
  • Psychometrics / methods
  • Reproducibility of Results
  • Risk Factors