Machine Learning for Biologics: Opportunities for Protein Engineering, Developability, and Formulation

Trends Pharmacol Sci. 2021 Mar;42(3):151-165. doi: 10.1016/j.tips.2020.12.004. Epub 2021 Jan 23.

Abstract

Successful biologics must satisfy multiple properties including activity and particular physicochemical features that are globally defined as developability. These multiple properties must be simultaneously optimized in a very broad design space of protein sequences and buffer compositions. In this context, artificial intelligence (AI), and especially machine learning (ML), have great potential to accelerate and improve the optimization of protein properties, increasing their activity and safety as well as decreasing their development time and manufacturing costs. We highlight the emerging applications of ML in biologics discovery and development, focusing on protein engineering, early biophysical screening, and formulation. We discuss the power of ML in extracting information from complex datasets and in reducing the necessary experimental effort to simultaneously achieve multiple quality targets. We finally anticipate possible future interventions of AI in several steps of the biological landscape.

Keywords: antibodies; biologics development; developability; formulation; machine learning; protein engineering.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Artificial Intelligence*
  • Biological Products*
  • Humans
  • Machine Learning
  • Protein Engineering
  • Proteins

Substances

  • Biological Products
  • Proteins