Estimating the accuracy of pharmacophore-based detection of cognate receptor-ligand pairs in the immunoglobulin superfamily

Proteins. 2021 Jun;89(6):632-638. doi: 10.1002/prot.26046. Epub 2021 Jan 28.

Abstract

Secreted and membrane-bound members of the immunoglobulin superfamily (IgSF) encompass a large, diverse array of proteins that play central roles in immune response and neural development, and are implicated in diseases ranging from cancer to rheumatoid arthritis. Despite the potential biomedical benefits of understanding IgSF:IgSF cognate receptor-ligand interactions, relatively little about them is known at a molecular level, and experimentally probing all possible receptor-ligand pairs is prohibitively costly. The Protein Ligand Interface Design (ProtLID) algorithm is a computational pharmacophore-based approach to identify cognate receptor-ligand pairs that was recently validated in a pilot study on a small set of IgSF complexes. Although ProtLID has shown a success rate of 61% at identifying at least one cognate ligand for a given receptor, it currently lacks any form of confidence measure that can prioritize individual receptor-ligand predictions to pursue experimentally. In this study, we expanded the application of ProtLID to cover all IgSF complexes with available structural data. In addition, we introduced an approach to estimate the confidence of predictions made by ProtLID based on a statistical analysis of how the ProtLID-constructed pharmacophore matches the structures of candidate ligands. The confidence score combines the physicochemical compatibility, spatial consistency, and mathematical skewness of the distribution of matches throughout a set of candidate ligands. Our results suggest that a subset of cases meeting stringent confidence criteria will always have at least one successful receptor-ligand prediction.

Keywords: binding partner identification; immunoglobulin superfamily; pharmacophores.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms*
  • Datasets as Topic
  • Humans
  • Immunoglobulins / chemistry*
  • Immunoglobulins / metabolism
  • Ligands
  • Membrane Proteins / chemistry*
  • Membrane Proteins / metabolism
  • Multigene Family*
  • Protein Binding
  • Protein Isoforms / chemistry
  • Protein Isoforms / metabolism
  • Research Design
  • Software*

Substances

  • IGSF1 protein, human
  • Immunoglobulins
  • Ligands
  • Membrane Proteins
  • Protein Isoforms