IPRO+/-: Computational Protein Design Tool Allowing for Insertions and Deletions

Structure. 2020 Dec 1;28(12):1344-1357.e4. doi: 10.1016/j.str.2020.08.003. Epub 2020 Aug 27.

Abstract

Insertions and deletions (indels) in protein sequences alter the residue spacing along the polypeptide backbone and consequently open up possibilities for tuning protein function in a way that is inaccessible by amino acid substitution alone. We describe an optimization-based computational protein redesign approach centered around predicting beneficial combinations of indels along with substitutions and also obtain putative substrate-docked structures for these protein variants. This modified algorithmic capability would be of interest for enzyme engineering and broadly inform other protein design tasks. We highlight this capability by (1) identifying active variants of a bacterial thioesterase enzyme ('TesA) with experimental corroboration, (2) recapitulating existing active TEM-1 β-Lactamase sequences of different sizes, and (3) identifying shorter 4-Coumarate:CoA ligases with enhanced in vitro activities toward non-native substrates. A separate PyRosetta-based open-source tool, Indel-Maker (http://www.maranasgroup.com/software.htm), has also been created to construct computational models of user-defined protein variants with specific indels and substitutions.

Keywords: Indel maker; biophysics; deletions; enzyme engineering; family sequence alignment; insertions; loop closure; mutational landscape; optimization; protein design.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Catalytic Domain
  • Coenzyme A Ligases / chemistry
  • Coenzyme A Ligases / metabolism
  • Escherichia coli Proteins / chemistry
  • Escherichia coli Proteins / metabolism
  • INDEL Mutation*
  • Lysophospholipase / chemistry
  • Lysophospholipase / metabolism
  • Molecular Docking Simulation / methods
  • Periplasmic Proteins / chemistry
  • Periplasmic Proteins / metabolism
  • Protein Binding
  • Protein Engineering / methods*
  • Sequence Analysis, Protein / methods*
  • Software*
  • beta-Lactamases / chemistry
  • beta-Lactamases / metabolism

Substances

  • Escherichia coli Proteins
  • Periplasmic Proteins
  • Lysophospholipase
  • tesA protein, E coli
  • beta-Lactamases
  • Coenzyme A Ligases
  • 4-coumarate-CoA ligase