pyTCR: A comprehensive and scalable solution for TCR-Seq data analysis to facilitate reproducibility and rigor of immunogenomics research

Front Immunol. 2022 Oct 27:13:954078. doi: 10.3389/fimmu.2022.954078. eCollection 2022.

Abstract

T cell receptor (TCR) studies have grown substantially with the advancement in the sequencing techniques of T cell receptor repertoire sequencing (TCR-Seq). The analysis of the TCR-Seq data requires computational skills to run the computational analysis of TCR repertoire tools. However biomedical researchers with limited computational backgrounds face numerous obstacles to properly and efficiently utilizing bioinformatics tools for analyzing TCR-Seq data. Here we report pyTCR, a computational notebook-based solution for comprehensive and scalable TCR-Seq data analysis. Computational notebooks, which combine code, calculations, and visualization, are able to provide users with a high level of flexibility and transparency for the analysis. Additionally, computational notebooks are demonstrated to be user-friendly and suitable for researchers with limited computational skills. Our tool has a rich set of functionalities including various TCR metrics, statistical analysis, and customizable visualizations. The application of pyTCR on large and diverse TCR-Seq datasets will enable the effective analysis of large-scale TCR-Seq data with flexibility, and eventually facilitate new discoveries.

Keywords: TCR - T cell receptor; TCR characterization; TCR-seq; computational notebooks; immunogenomics; reproducibility.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Benchmarking
  • Computational Biology
  • Data Analysis*
  • Receptors, Antigen, T-Cell* / genetics
  • Reproducibility of Results

Substances

  • Receptors, Antigen, T-Cell