Deep learning of protein sequence design of protein-protein interactions

Bioinformatics. 2023 Jan 1;39(1):btac733. doi: 10.1093/bioinformatics/btac733.

Abstract

Motivation: As more data of experimentally determined protein structures are becoming available, data-driven models to describe protein sequence-structure relationships become more feasible. Within this space, the amino acid sequence design of protein-protein interactions is still a rather challenging subproblem with very low success rates-yet, it is central to most biological processes.

Results: We developed an attention-based deep learning model inspired by algorithms used for image-caption assignments to design peptides or protein fragment sequences. Our trained model can be applied for the redesign of natural protein interfaces or the designed protein interaction fragments. Here, we validate the potential by recapitulating naturally occurring protein-protein interactions including antibody-antigen complexes. The designed interfaces accurately capture essential native interactions and have comparable native-like binding affinities in silico. Furthermore, our model does not need a precise backbone location, making it an attractive tool for working with de novo design of protein-protein interactions.

Availability and implementation: The source code of the method is available at https://github.com/strauchlab/iNNterfaceDesign.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Amino Acid Sequence
  • Deep Learning*
  • Peptides
  • Proteins / chemistry
  • Software

Substances

  • Proteins
  • Peptides