Function2Form Bridge-Toward synthetic protein holistic performance prediction

Proteins. 2020 Mar;88(3):462-475. doi: 10.1002/prot.25825. Epub 2019 Oct 29.

Abstract

Protein engineering and synthetic biology stand to benefit immensely from recent advances in silico tools for structural and functional analyses of proteins. In the context of designing novel proteins, current in silico tools inform the user on individual parameters of a query protein, with output scores/metrics unique to each parameter. In reality, proteins feature multiple "parts"/functions and modification of a protein aimed at altering a given part, typically has collateral impact on other protein parts. A system for prediction of the combined effect of design parameters on the overall performance of the final protein does not exist. Function2Form Bridge (F2F-Bridge) attempts to address this by combining the scores of different design parameters pertaining to the protein being analyzed into a single easily interpreted output describing overall performance. The strategy comprises of (a) a mathematical strategy combining data from a myriad of in silico tools into an OP-score (a singular score informing on a user-defined overall performance) and (b) the F2F Plot, a graphical means of informing the wetlab biologist holistically on designed construct suitability in the context of multiple parameters, highlighting scope for improvement. F2F predictive output was compared with wetlab data from a range of synthetic proteins designed, built, and tested for this study. Statistical/machine learning approaches for predicting overall performance, for use alongside the F2F plot, were also examined. Comparisons between wetlab performance and F2F predictions demonstrated close and reliable correlations. This user-friendly strategy represents a pivotal enabler in increasing the accessibility of synthetic protein building and de novo protein design.

Keywords: antibody screening; community-based data reporting; de novo protein design; high throughput in silico screening; in silico modeling; machine learning; protein scoring; synthetic biology.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Antibodies / chemistry*
  • Antibodies / metabolism
  • Coagulase / chemistry*
  • Coagulase / metabolism
  • Humans
  • Machine Learning*
  • Models, Statistical
  • Mucin-1 / chemistry*
  • Mucin-1 / metabolism
  • Protein Engineering / methods
  • Staphylococcus aureus / chemistry
  • Structure-Activity Relationship
  • Synthetic Biology / methods*

Substances

  • Antibodies
  • ClfA protein, Staphylococcus aureus
  • Coagulase
  • MUC1 protein, human
  • Mucin-1