Identification of B cell subsets based on antigen receptor sequences using deep learning

Front Immunol. 2024 Mar 21:15:1342285. doi: 10.3389/fimmu.2024.1342285. eCollection 2024.

Abstract

B cell receptors (BCRs) denote antigen specificity, while corresponding cell subsets indicate B cell functionality. Since each B cell uniquely encodes this combination, physical isolation and subsequent processing of individual B cells become indispensable to identify both attributes. However, this approach accompanies high costs and inevitable information loss, hindering high-throughput investigation of B cell populations. Here, we present BCR-SORT, a deep learning model that predicts cell subsets from their corresponding BCR sequences by leveraging B cell activation and maturation signatures encoded within BCR sequences. Subsequently, BCR-SORT is demonstrated to improve reconstruction of BCR phylogenetic trees, and reproduce results consistent with those verified using physical isolation-based methods or prior knowledge. Notably, when applied to BCR sequences from COVID-19 vaccine recipients, it revealed inter-individual heterogeneity of evolutionary trajectories towards Omicron-binding memory B cells. Overall, BCR-SORT offers great potential to improve our understanding of B cell responses.

Keywords: B cell phylogenetic inference; B cell receptor; B cell subset; antibody repertoire; deep learning; integrated gradients; next-generation sequencing; somatic hypermutation.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • B-Lymphocyte Subsets*
  • COVID-19 Vaccines
  • Deep Learning*
  • Humans
  • Phylogeny
  • Receptors, Antigen, B-Cell / genetics

Substances

  • COVID-19 Vaccines
  • Receptors, Antigen, B-Cell

Grants and funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work was supported by the Ministry of Science and ICT (MSIT, Republic of Korea), National Research Foundation of Korea (NRF-2020R1A3B3079653), and BK21 FOUR program of the Education and Research Program for Future ICT Pioneers (Seoul National University in 2023). This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT, No. RS-2023-00302766). This research was supported by a grant of the Korea Dementia Research Project through the Korea Dementia Research Center (KDRC), funded by the Ministry of Health & Welfare and Ministry of Science and ICT, Republic of Korea (HU20C0339). This research was supported by a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health & Welfare, Republic of Korea (HI23C0521).