The missing link between genetic association and regulatory function

Elife. 2022 Dec 14:11:e74970. doi: 10.7554/eLife.74970.

Abstract

The genetic basis of most traits is highly polygenic and dominated by non-coding alleles. It is widely assumed that such alleles exert small regulatory effects on the expression of cis-linked genes. However, despite the availability of gene expression and epigenomic datasets, few variant-to-gene links have emerged. It is unclear whether these sparse results are due to limitations in available data and methods, or to deficiencies in the underlying assumed model. To better distinguish between these possibilities, we identified 220 gene-trait pairs in which protein-coding variants influence a complex trait or its Mendelian cognate. Despite the presence of expression quantitative trait loci near most GWAS associations, by applying a gene-based approach we found limited evidence that the baseline expression of trait-related genes explains GWAS associations, whether using colocalization methods (8% of genes implicated), transcription-wide association (2% of genes implicated), or a combination of regulatory annotations and distance (4% of genes implicated). These results contradict the hypothesis that most complex trait-associated variants coincide with homeostatic expression QTLs, suggesting that better models are needed. The field must confront this deficit and pursue this 'missing regulation.'

Keywords: colocalization; eQTL; gene regulation; genetics; genomics; human; non-coding variants.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Epigenomics
  • Genetic Predisposition to Disease
  • Genome-Wide Association Study* / methods
  • Humans
  • Multifactorial Inheritance / genetics
  • Phenotype
  • Polymorphism, Single Nucleotide
  • Quantitative Trait Loci*

Associated data

  • Dryad/10.5061/dryad.612jm644q