Design of Biopharmaceutical Formulations Accelerated by Machine Learning

Mol Pharm. 2021 Oct 4;18(10):3843-3853. doi: 10.1021/acs.molpharmaceut.1c00469. Epub 2021 Sep 14.

Abstract

In addition to activity, successful biological drugs must exhibit a series of suitable developability properties, which depend on both protein sequence and buffer composition. In the context of this high-dimensional optimization problem, advanced algorithms from the domain of machine learning are highly beneficial in complementing analytical screening and rational design. Here, we propose a Bayesian optimization algorithm to accelerate the design of biopharmaceutical formulations. We demonstrate the power of this approach by identifying the formulation that optimizes the thermal stability of three tandem single-chain Fv variants within 25 experiments, a number which is less than one-third of the experiments that would be required by a classical DoE method and several orders of magnitude smaller compared to detailed experimental analysis of full combinatorial space. We further show the advantage of this method over conventional approaches to efficiently transfer historical information as prior knowledge for the development of new biologics or when new buffer agents are available. Moreover, we highlight the benefit of our technique in engineering multiple biophysical properties by simultaneously optimizing both thermal and interface stabilities. This optimization minimizes the amount of surfactant in the formulation, which is important to decrease the risks associated with corresponding degradation processes. Overall, this method can provide high speed of converging to optimal conditions, the ability to transfer prior knowledge, and the identification of new nonlinear combinations of excipients. We envision that these features can lead to a considerable acceleration in formulation design and to parallelization of operations during drug development.

Keywords: Bayesian optimization; antibodies; artificial intelligence; biopharmaceuticals; developability; formulation; machine learning; stability.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bayes Theorem
  • Biological Products / administration & dosage*
  • Biological Products / therapeutic use
  • Drug Compounding / methods*
  • Drug Evaluation, Preclinical / methods
  • Humans
  • Machine Learning*
  • Nanoparticle Drug Delivery System / administration & dosage

Substances

  • Biological Products
  • Nanoparticle Drug Delivery System