Structure-guided reprogramming of serine recombinase DNA sequence specificity

Proc Natl Acad Sci U S A. 2011 Jan 11;108(2):498-503. doi: 10.1073/pnas.1014214108. Epub 2010 Dec 27.

Abstract

Routine manipulation of cellular genomes is contingent upon the development of proteins and enzymes with programmable DNA sequence specificity. Here we describe the structure-guided reprogramming of the DNA sequence specificity of the invertase Gin from bacteriophage Mu and Tn3 resolvase from Escherichia coli. Structure-guided and comparative sequence analyses were used to predict a network of amino acid residues that mediate resolvase and invertase DNA sequence specificity. Using saturation mutagenesis and iterative rounds of positive antibiotic selection, we identified extensively redesigned and highly convergent resolvase and invertase populations in the context of engineered zinc-finger recombinase (ZFR) fusion proteins. Reprogrammed variants selectively catalyzed recombination of nonnative DNA sequences > 10,000-fold more effectively than their parental enzymes. Alanine-scanning mutagenesis revealed the molecular basis of resolvase and invertase DNA sequence specificity. When used as rationally designed ZFR heterodimers, the reprogrammed enzyme variants site-specifically modified unnatural and asymmetric DNA sequences. Early studies on the directed evolution of serine recombinase DNA sequence specificity produced enzymes with relaxed substrate specificity as a result of randomly incorporated mutations. In the current study, we focused our mutagenesis exclusively on DNA determinants, leading to redesigned enzymes that remained highly specific and directed transgene integration into the human genome with > 80% accuracy. These results demonstrate that unique resolvase and invertase derivatives can be developed to site-specifically modify the human genome in the context of zinc-finger recombinase fusion proteins.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Bacteriophage mu / metabolism
  • DNA Nucleotidyltransferases / genetics*
  • Dimerization
  • Escherichia coli / enzymology
  • Gene Targeting
  • Genome, Human
  • Humans
  • Models, Molecular
  • Molecular Sequence Data
  • Mutagenesis
  • Protein Conformation
  • Protein Engineering / methods
  • Protein Structure, Secondary
  • Recombinases / genetics*
  • Sequence Analysis, DNA
  • Sequence Homology, Amino Acid
  • Serine / chemistry*
  • Transgenes
  • Transposon Resolvases / genetics*

Substances

  • Recombinases
  • Serine
  • DNA Nucleotidyltransferases
  • DNA invertase Gin
  • Tn3 resolvase
  • Transposon Resolvases