SPIN2: Predicting sequence profiles from protein structures using deep neural networks

Proteins. 2018 Jun;86(6):629-633. doi: 10.1002/prot.25489. Epub 2018 Mar 25.

Abstract

Designing protein sequences that can fold into a given structure is a well-known inverse protein-folding problem. One important characteristic to attain for a protein design program is the ability to recover wild-type sequences given their native backbone structures. The highest average sequence identity accuracy achieved by current protein-design programs in this problem is around 30%, achieved by our previous system, SPIN. SPIN is a program that predicts sequences compatible with a provided structure using a neural network with fragment-based local and energy-based nonlocal profiles. Our new model, SPIN2, uses a deep neural network and additional structural features to improve on SPIN. SPIN2 achieves over 34% in sequence recovery in 10-fold cross-validation and independent tests, a 4% improvement over the previous version. The sequence profiles generated from SPIN2 are expected to be useful for improving existing fold recognition and protein design techniques. SPIN2 is available at http://sparks-lab.org.

Keywords: bioinformatics; deep learning; fold recognition; neural networks; structure prediction.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Databases, Protein
  • Models, Molecular
  • Neural Networks, Computer*
  • Protein Folding
  • Protein Structure, Secondary
  • Proteins / chemistry*
  • Software*

Substances

  • Proteins