High-throughput immune repertoire analysis with IGoR

Nat Commun. 2018 Feb 8;9(1):561. doi: 10.1038/s41467-018-02832-w.

Abstract

High-throughput immune repertoire sequencing is promising to lead to new statistical diagnostic tools for medicine and biology. Successful implementations of these methods require a correct characterization, analysis, and interpretation of these data sets. We present IGoR (Inference and Generation Of Repertoires)-a comprehensive tool that takes B or T cell receptor sequence reads and quantitatively characterizes the statistics of receptor generation from both cDNA and gDNA. It probabilistically annotates sequences and its modular structure can be used to investigate models of increasing biological complexity for different organisms. For B cells, IGoR returns the hypermutation statistics, which we use to reveal co-localization of hypermutations along the sequence. We demonstrate that IGoR outperforms existing tools in accuracy and estimate the sample sizes needed for reliable repertoire characterization.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • B-Lymphocytes / cytology
  • B-Lymphocytes / immunology*
  • Base Sequence
  • Benchmarking
  • DNA, Complementary / genetics
  • DNA, Complementary / immunology
  • Gene Expression
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Immunity, Innate
  • Molecular Sequence Annotation
  • Receptors, Antigen, B-Cell / genetics*
  • Receptors, Antigen, B-Cell / immunology
  • Receptors, Antigen, T-Cell / genetics*
  • Receptors, Antigen, T-Cell / immunology
  • Software*
  • T-Lymphocytes / cytology
  • T-Lymphocytes / immunology*
  • V(D)J Recombination*

Substances

  • DNA, Complementary
  • Receptors, Antigen, B-Cell
  • Receptors, Antigen, T-Cell