Protein Function Analysis through Machine Learning

Biomolecules. 2022 Sep 6;12(9):1246. doi: 10.3390/biom12091246.

Abstract

Machine learning (ML) has been an important arsenal in computational biology used to elucidate protein function for decades. With the recent burgeoning of novel ML methods and applications, new ML approaches have been incorporated into many areas of computational biology dealing with protein function. We examine how ML has been integrated into a wide range of computational models to improve prediction accuracy and gain a better understanding of protein function. The applications discussed are protein structure prediction, protein engineering using sequence modifications to achieve stability and druggability characteristics, molecular docking in terms of protein-ligand binding, including allosteric effects, protein-protein interactions and protein-centric drug discovery. To quantify the mechanisms underlying protein function, a holistic approach that takes structure, flexibility, stability, and dynamics into account is required, as these aspects become inseparable through their interdependence. Another key component of protein function is conformational dynamics, which often manifest as protein kinetics. Computational methods that use ML to generate representative conformational ensembles and quantify differences in conformational ensembles important for function are included in this review. Future opportunities are highlighted for each of these topics.

Keywords: allostery; conformational sampling; force fields; machine learning; molecular docking; protein dynamics; protein function; protein structure prediction; protein–protein interactions.

Publication types

  • Review
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, N.I.H., Extramural

MeSH terms

  • Computational Biology* / methods
  • Ligands
  • Machine Learning
  • Molecular Docking Simulation
  • Molecular Dynamics Simulation
  • Protein Conformation
  • Proteins* / chemistry

Substances

  • Ligands
  • Proteins