AbFlex: designing antibody complementarity determining regions with flexible CDR definition

Bioinformatics. 2024 Mar 4;40(3):btae122. doi: 10.1093/bioinformatics/btae122.

Abstract

Motivation: Antibodies are proteins that the immune system produces in response to foreign pathogens. Designing antibodies that specifically bind to antigens is a key step in developing antibody therapeutics. The complementarity determining regions (CDRs) of the antibody are mainly responsible for binding to the target antigen, and therefore must be designed to recognize the antigen.

Results: We develop an antibody design model, AbFlex, that exhibits state-of-the-art performance in terms of structure prediction accuracy and amino acid recovery rate. Furthermore, >38% of newly designed antibody models are estimated to have better binding energies for their antigens than wild types. The effectiveness of the model is attributed to two different strategies that are developed to overcome the difficulty associated with the scarcity of antibody-antigen complex structure data. One strategy is to use an equivariant graph neural network model that is more data-efficient. More importantly, a new data augmentation strategy based on the flexible definition of CDRs significantly increases the performance of the CDR prediction model.

Availability and implementation: The source code and implementation are available at https://github.com/wsjeon92/AbFlex.

MeSH terms

  • Amino Acid Sequence
  • Antigen-Antibody Complex* / chemistry
  • Antigens
  • Complementarity Determining Regions* / chemistry
  • Complementarity Determining Regions* / metabolism
  • Models, Molecular

Substances

  • Complementarity Determining Regions
  • Antigen-Antibody Complex
  • Antigens