Low-data interpretable deep learning prediction of antibody viscosity using a biophysically meaningful representation

Sci Rep. 2023 Feb 20;13(1):2917. doi: 10.1038/s41598-023-28841-4.

Abstract

Deep learning, aided by the availability of big data sets, has led to substantial advances across many disciplines. However, many scientific problems of practical interest lack sufficiently large datasets amenable to deep learning. Prediction of antibody viscosity is one such problem where deep learning methods have not yet been explored due to the relative scarcity of relevant training data. In this work, we overcome this limitation using a biophysically meaningful representation that enables us to develop generalizable models even under limited training data. We present, PfAbNet-viscosity, a 3D convolutional neural network architecture, to predict high-concentration viscosity of therapeutic antibodies. We show that with the electrostatic potential surface of the antibody variable region as the only input to the network, the models trained on as few as couple dozen datapoints can generalize with high accuracy. Our feature attribution analysis shows that PfAbNet-viscosity has learned key biophysical drivers of viscosity. The applicability of our approach to other biological systems is discussed.

MeSH terms

  • Antibodies
  • Big Data
  • Deep Learning*
  • Immunoglobulin Variable Region
  • Viscosity

Substances

  • Antibodies
  • Immunoglobulin Variable Region