Novel machine learning approaches revolutionize protein knowledge

Trends Biochem Sci. 2023 Apr;48(4):345-359. doi: 10.1016/j.tibs.2022.11.001. Epub 2022 Dec 9.

Abstract

Breakthrough methods in machine learning (ML), protein structure prediction, and novel ultrafast structural aligners are revolutionizing structural biology. Obtaining accurate models of proteins and annotating their functions on a large scale is no longer limited by time and resources. The most recent method to be top ranked by the Critical Assessment of Structure Prediction (CASP) assessment, AlphaFold 2 (AF2), is capable of building structural models with an accuracy comparable to that of experimental structures. Annotations of 3D models are keeping pace with the deposition of the structures due to advancements in protein language models (pLMs) and structural aligners that help validate these transferred annotations. In this review we describe how recent developments in ML for protein science are making large-scale structural bioinformatics available to the general scientific community.

Keywords: AI; AlphaFold2; embeddings; machine learning; pLM; protein structure prediction; structure alignment.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology / methods
  • Machine Learning*
  • Protein Conformation
  • Proteins* / chemistry

Substances

  • Proteins