An integrative analysis of TFBS-clustered regions reveals new transcriptional regulation models on the accessible chromatin landscape

Sci Rep. 2015 Feb 16:5:8465. doi: 10.1038/srep08465.

Abstract

DNase I hypersensitive sites (DHSs) define the accessible chromatin landscape and have revolutionised the discovery of distinct cis-regulatory elements in diverse organisms. Here, we report the first comprehensive map of human transcription factor binding site (TFBS)-clustered regions using Gaussian kernel density estimation based on genome-wide mapping of the TFBSs in 133 human cell and tissue types. Approximately 1.6 million distinct TFBS-clustered regions, collectively spanning 27.7% of the human genome, were discovered. The TFBS complexity assigned to each TFBS-clustered region was highly correlated with genomic location, cell selectivity, evolutionary conservation, sequence features, and functional roles. An integrative analysis of these regions using ENCODE data revealed transcription factor occupancy, transcriptional activity, histone modification, DNA methylation, and chromatin structures that varied based on TFBS complexity. Furthermore, we found that we could recreate lineage-branching relationships by simple clustering of the TFBS-clustered regions from terminally differentiated cells. Based on these findings, a model of transcriptional regulation determined by TFBS complexity is proposed.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Binding Sites
  • Chromatin / chemistry
  • Chromatin / metabolism*
  • Chromatin Immunoprecipitation
  • Cluster Analysis
  • Databases, Factual
  • Epigenesis, Genetic
  • Evolution, Molecular
  • Gene Expression Regulation
  • Genome, Human
  • Histones / metabolism
  • Humans
  • K562 Cells
  • Methylation
  • Models, Genetic
  • RNA / metabolism
  • RNA Polymerase II / metabolism
  • Transcription Factors / genetics
  • Transcription Factors / metabolism*
  • Transcription, Genetic

Substances

  • Chromatin
  • Histones
  • Transcription Factors
  • RNA
  • RNA Polymerase II