Improved modeling of side-chain--base interactions and plasticity in protein--DNA interface design

J Mol Biol. 2012 Jun 8;419(3-4):255-74. doi: 10.1016/j.jmb.2012.03.005. Epub 2012 Mar 15.

Abstract

Combinatorial sequence optimization for protein design requires libraries of discrete side-chain conformations. The discreteness of these libraries is problematic, particularly for long, polar side chains, since favorable interactions can be missed. Previously, an approach to loop remodeling where protein backbone movement is directed by side-chain rotamers predicted to form interactions previously observed in native complexes (termed "motifs") was described. Here, we show how such motif libraries can be incorporated into combinatorial sequence optimization protocols and improve native complex recapitulation. Guided by the motif rotamer searches, we made improvements to the underlying energy function, increasing recapitulation of native interactions. To further test the methods, we carried out a comprehensive experimental scan of amino acid preferences in the I-AniI protein-DNA interface and found that many positions tolerated multiple amino acids. This sequence plasticity is not observed in the computational results because of the fixed-backbone approximation of the model. We improved modeling of this diversity by introducing DNA flexibility and reducing the convergence of the simulated annealing algorithm that drives the design process. In addition to serving as a benchmark, this extensive experimental data set provides insight into the types of interactions essential to maintain the function of this potential gene therapy reagent.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Amino Acids / chemistry
  • Computer Simulation
  • DNA / chemistry*
  • DNA / metabolism
  • DNA-Binding Proteins / chemistry*
  • Models, Molecular
  • Nucleic Acid Conformation
  • Protein Binding
  • Protein Conformation
  • Protein Structure, Tertiary*
  • Proteins / chemistry
  • Proteins / metabolism

Substances

  • Amino Acids
  • DNA-Binding Proteins
  • Proteins
  • DNA