An experimental verification of the predicted effects of promoter TATA-box polymorphisms associated with human diseases on interactions between the TATA boxes and TATA-binding protein

PLoS One. 2013;8(2):e54626. doi: 10.1371/journal.pone.0054626. Epub 2013 Feb 12.

Abstract

Human genome sequencing has resulted in a great body of data, including a stunningly large number of single nucleotide polymorphisms (SNPs) with unknown phenotypic manifestations. Identification and comprehensive analysis of regulatory SNPs in human gene promoters will help quantify the effects of these SNPs on human health. Based on our experimental and computer-aided study of SNPs in TATA boxes and the use of literature data, we have derived an equation for TBP/TATA equilibrium binding in three successive steps: TATA-binding protein (TBP) sliding along DNA due to their nonspecific affinity for each other ↔ recognition of the TATA box ↔ stabilization of the TBP/TATA complex. Using this equation, we have analyzed TATA boxes containing SNPs associated with human diseases and made in silico predictions of changes in TBP/TATA affinity. An electrophoretic mobility shift assay (EMSA)-based experimental study performed under the most standardized conditions demonstrates that the experimentally measured values are highly correlated with the predicted values: the coefficient of linear correlation, r, was 0.822 at a significance level of α<10⁻⁷ for equilibrium K(D) values, (-ln K(D)), and 0.785 at a significance level of α<10⁻³ for changes in equilibrium K(D) (δ) due to SNPs in the TATA boxes (δ= -ln[K(D,TATAMut)]-(-ln[K(D,TATAMut)])). It has been demonstrated that the SNPs associated with increased risk of human diseases such as α-, β- and δ-thalassemia, myocardial infarction and thrombophlebitis, changes in immune response, amyotrophic lateral sclerosis, lung cancer and hemophilia B Leyden cause 2-4-fold changes in TBP/TATA affinity in most cases. The results obtained strongly suggest that the TBP/TATA equilibrium binding equation derived can be used for analysis of TATA-box sequences and identification of SNPs with a potential of being functionally important.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aryl Hydrocarbon Hydroxylases / genetics
  • Computational Biology*
  • Cytochrome P-450 CYP2A6
  • Disease / genetics*
  • Globins / genetics
  • Humans
  • Interleukin-1beta / genetics
  • Mannose-Binding Lectin / genetics
  • Models, Biological
  • Nitric Oxide Synthase Type II / genetics
  • Polymorphism, Single Nucleotide*
  • Promoter Regions, Genetic / genetics*
  • Protein Binding
  • Superoxide Dismutase / genetics
  • Superoxide Dismutase-1
  • TATA Box / genetics*
  • TATA-Box Binding Protein / metabolism*

Substances

  • Interleukin-1beta
  • MBL2 protein, human
  • Mannose-Binding Lectin
  • SOD1 protein, human
  • TATA-Box Binding Protein
  • Globins
  • NOS2 protein, human
  • Nitric Oxide Synthase Type II
  • Aryl Hydrocarbon Hydroxylases
  • Cytochrome P-450 CYP2A6
  • Superoxide Dismutase
  • Superoxide Dismutase-1

Grants and funding

The authors are grateful to Prof. Tatyana Merkulova for her valuable advice and Vladimir Koval for the MALDI-TOF/MS analysis of oligodeoxynucleotides. This work was supported by grants 10-04-00462-a and Research schools - 5278.2012.4 from the Russian Foundation for Basic Research, grant 136 from the Siberian Branch of the Russian Academy of Sciences, “Biological Diversity” from the Russian Academy of Sciences, grant B.30, #30.29, “Molecular and Cell Biology” RAS integration 6.8 “System biology: the computer-experimental approaches” and #28 “Life Origin and Biosphere becoming problems” programs (Russian Academy of Sciences). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.