Before and after AlphaFold2: An overview of protein structure prediction

Front Bioinform. 2023 Feb 28:3:1120370. doi: 10.3389/fbinf.2023.1120370. eCollection 2023.

Abstract

Three-dimensional protein structure is directly correlated with its function and its determination is critical to understanding biological processes and addressing human health and life science problems in general. Although new protein structures are experimentally obtained over time, there is still a large difference between the number of protein sequences placed in Uniprot and those with resolved tertiary structure. In this context, studies have emerged to predict protein structures by methods based on a template or free modeling. In the last years, different methods have been combined to overcome their individual limitations, until the emergence of AlphaFold2, which demonstrated that predicting protein structure with high accuracy at unprecedented scale is possible. Despite its current impact in the field, AlphaFold2 has limitations. Recently, new methods based on protein language models have promised to revolutionize the protein structural biology allowing the discovery of protein structure and function only from evolutionary patterns present on protein sequence. Even though these methods do not reach AlphaFold2 accuracy, they already covered some of its limitations, being able to predict with high accuracy more than 200 million proteins from metagenomic databases. In this mini-review, we provide an overview of the breakthroughs in protein structure prediction before and after AlphaFold2 emergence.

Keywords: AlphaFold; free modeling; protein language model; protein structure prediction; template-based modeling.

Publication types

  • Review

Grants and funding

This work was supported by Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) [INCT—2014/50889-7 to JK], the Conselho Nacional de Desenvolvimento Científico e Tecnológico—CNPq [INCT—465586/2014-7 to JK], the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior—CAPES [PROEX—169192/2018-0 to LB] and the Zerbini Foundation and Foxconn Brazil as part of a research grant “Machine Learning in Cardiovascular Medicine”.