Deep Learning-Based Advances in Protein Structure Prediction

Int J Mol Sci. 2021 May 24;22(11):5553. doi: 10.3390/ijms22115553.

Abstract

Obtaining an accurate description of protein structure is a fundamental step toward understanding the underpinning of biology. Although recent advances in experimental approaches have greatly enhanced our capabilities to experimentally determine protein structures, the gap between the number of protein sequences and known protein structures is ever increasing. Computational protein structure prediction is one of the ways to fill this gap. Recently, the protein structure prediction field has witnessed a lot of advances due to Deep Learning (DL)-based approaches as evidenced by the success of AlphaFold2 in the most recent Critical Assessment of protein Structure Prediction (CASP14). In this article, we highlight important milestones and progresses in the field of protein structure prediction due to DL-based methods as observed in CASP experiments. We describe advances in various steps of protein structure prediction pipeline viz. protein contact map prediction, protein distogram prediction, protein real-valued distance prediction, and Quality Assessment/refinement. We also highlight some end-to-end DL-based approaches for protein structure prediction approaches. Additionally, as there have been some recent DL-based advances in protein structure determination using Cryo-Electron (Cryo-EM) microscopy based, we also highlight some of the important progress in the field. Finally, we provide an outlook and possible future research directions for DL-based approaches in the protein structure prediction arena.

Keywords: deep learning; protein contact map prediction; protein distance prediction; protein quality assessment; protein structure prediction.

Publication types

  • Review

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Computational Biology / methods*
  • Cryoelectron Microscopy / methods*
  • Databases, Protein
  • Deep Learning*
  • Models, Molecular
  • Neural Networks, Computer
  • Protein Conformation
  • Proteins / chemistry*
  • Sequence Analysis, Protein / methods*
  • Software

Substances

  • Proteins