Inter- and intraspecies comparison of phylogenetic fingerprints and sequence diversity of immunoglobulin variable genes

Immunogenetics. 2020 Jul;72(5):279-294. doi: 10.1007/s00251-020-01164-8. Epub 2020 May 4.

Abstract

Protection and neutralization of a vast array of pathogens is accomplished by the tremendous diversity of the B cell receptor (BCR) repertoire. For jawed vertebrates, this diversity is initiated via the somatic recombination of immunoglobulin (Ig) germline elements. While it is clear that the number of these germline segments differs from species to species, the extent of cross-species sequence diversity remains largely uncharacterized. Here we use extensive computational and statistical methods to investigate the sequence diversity and evolutionary relationship between Ig variable (V), diversity (D), and joining (J) germline segments across nine commonly studied species ranging from zebrafish to human. Metrics such as guanine-cytosine (GC) content showed low redundancy across Ig germline genes within a given species. Other comparisons, including amino acid motifs, evolutionary selection, and sequence diversity, revealed species-specific properties. Additionally, we showed that the germline-encoded diversity differs across antibody (recombined V-D-J) repertoires of various B cell subsets. To facilitate future comparative immunogenomics analysis, we created VDJgermlines, an R package that contains the germline sequences from multiple species. Our study informs strategies for the humanization and engineering of therapeutic antibodies.

Keywords: Antibody; Bioinformatics; Germline; Phylogenetics; R package; V-D-J recombination.

Publication types

  • Comparative Study

MeSH terms

  • Amino Acid Motifs
  • Animals
  • Antibody Diversity / genetics
  • B-Lymphocyte Subsets / metabolism
  • B-Lymphocytes / metabolism
  • Base Composition
  • Genetic Variation*
  • Humans
  • Immunoglobulin Heavy Chains / chemistry
  • Immunoglobulin Heavy Chains / genetics
  • Immunoglobulin Heavy Chains / metabolism
  • Immunoglobulin Variable Region / chemistry
  • Immunoglobulin Variable Region / genetics*
  • Phylogeny*
  • Selection, Genetic
  • Species Specificity
  • V(D)J Recombination / genetics
  • Vertebrates

Substances

  • Immunoglobulin Heavy Chains
  • Immunoglobulin Variable Region