Statistical analysis of structural determinants for protein-DNA-binding specificity

Proteins. 2016 Aug;84(8):1147-61. doi: 10.1002/prot.25061. Epub 2016 Jun 15.

Abstract

DNA-binding proteins play critical roles in biological processes including gene expression, DNA packaging and DNA repair. They bind to DNA target sequences with different degrees of binding specificity, ranging from highly specific (HS) to nonspecific (NS). Alterations of DNA-binding specificity, due to either genetic variation or somatic mutations, can lead to various diseases. In this study, a comparative analysis of protein-DNA complex structures was carried out to investigate the structural features that contribute to binding specificity. Protein-DNA complexes were grouped into three general classes based on degrees of binding specificity: HS, multispecific (MS), and NS. Our results show a clear trend of structural features among the three classes, including amino acid binding propensities, simple and complex hydrogen bonds, major/minor groove and base contacts, and DNA shape. We found that aspartate is enriched in HS DNA binding proteins and predominately binds to a cytosine through a single hydrogen bond or two consecutive cytosines through bidentate hydrogen bonds. Aromatic residues, histidine and tyrosine, are highly enriched in the HS and MS groups and may contribute to specific binding through different mechanisms. To further investigate the role of protein flexibility in specific protein-DNA recognition, we analyzed the conformational changes between the bound and unbound states of DNA-binding proteins and structural variations. The results indicate that HS and MS DNA-binding domains have larger conformational changes upon DNA-binding and larger degree of flexibility in both bound and unbound states. Proteins 2016; 84:1147-1161. © 2016 Wiley Periodicals, Inc.

Keywords: DNA shape; aromatic residue; aspartate; conformational changes; flexibility; hydrogen bond; structural variations; transcription factor; π-interaction.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Amino Acids / chemistry*
  • Binding Sites
  • DNA / chemistry*
  • DNA-Binding Proteins / chemistry*
  • Hydrogen Bonding
  • Hydrophobic and Hydrophilic Interactions
  • Models, Molecular
  • Nucleic Acid Conformation
  • Protein Binding
  • Protein Interaction Domains and Motifs
  • Protein Structure, Secondary
  • Static Electricity
  • Thermodynamics

Substances

  • Amino Acids
  • DNA-Binding Proteins
  • DNA