Landscape of allele-specific transcription factor binding in the human genome

Nat Commun. 2021 May 12;12(1):2751. doi: 10.1038/s41467-021-23007-0.

Abstract

Sequence variants in gene regulatory regions alter gene expression and contribute to phenotypes of individual cells and the whole organism, including disease susceptibility and progression. Single-nucleotide variants in enhancers or promoters may affect gene transcription by altering transcription factor binding sites. Differential transcription factor binding in heterozygous genomic loci provides a natural source of information on such regulatory variants. We present a novel approach to call the allele-specific transcription factor binding events at single-nucleotide variants in ChIP-Seq data, taking into account the joint contribution of aneuploidy and local copy number variation, that is estimated directly from variant calls. We have conducted a meta-analysis of more than 7 thousand ChIP-Seq experiments and assembled the database of allele-specific binding events listing more than half a million entries at nearly 270 thousand single-nucleotide polymorphisms for several hundred human transcription factors and cell types. These polymorphisms are enriched for associations with phenotypes of medical relevance and often overlap eQTLs, making candidates for causality by linking variants with molecular mechanisms. Specifically, there is a special class of switching sites, where different transcription factors preferably bind alternative alleles, thus revealing allele-specific rewiring of molecular circuitry.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alleles*
  • Chromatin / metabolism
  • Databases, Genetic
  • Gene Dosage
  • Gene Expression Regulation / genetics
  • Genome, Human*
  • Genome-Wide Association Study
  • Humans
  • Nucleotide Motifs
  • Phenotype
  • Polymorphism, Single Nucleotide
  • Protein Binding
  • Quantitative Trait Loci
  • Regulatory Sequences, Nucleic Acid / genetics*
  • Transcription Factors / metabolism*

Substances

  • Chromatin
  • Transcription Factors