A Bayesian method to infer copy number clones from single-cell RNA and ATAC sequencing

PLoS Comput Biol. 2023 Nov 2;19(11):e1011557. doi: 10.1371/journal.pcbi.1011557. eCollection 2023 Nov.

Abstract

Single-cell RNA and ATAC sequencing technologies enable the examination of gene expression and chromatin accessibility in individual cells, providing insights into cellular phenotypes. In cancer research, it is important to consistently analyze these states within an evolutionary context on genetic clones. Here we present CONGAS+, a Bayesian model to map single-cell RNA and ATAC profiles onto the latent space of copy number clones. CONGAS+ clusters cells into tumour subclones with similar ploidy, rendering straightforward to compare their expression and chromatin profiles. The framework, implemented on GPU and tested on real and simulated data, scales to analyse seamlessly thousands of cells, demonstrating better performance than single-molecule models, and supporting new multi-omics assays. In prostate cancer, lymphoma and basal cell carcinoma, CONGAS+ successfully identifies complex subclonal architectures while providing a coherent mapping between ATAC and RNA, facilitating the study of genotype-phenotype maps and their connection to genomic instability.

MeSH terms

  • Bayes Theorem
  • Chromatin
  • Clone Cells
  • DNA Copy Number Variations* / genetics
  • High-Throughput Nucleotide Sequencing / methods
  • RNA* / genetics

Substances

  • RNA
  • Chromatin

Grants and funding

This work was funded by the CRUK/AIRC Accelerator Award #22790 to (MA, AG and GC), “Single-cell Cancer Evolution in the Clinic”, by AIRC under MFAG 2020 - ID. 24913 project – P.I. Caravagna Giulio to (GC), by the European Commission Program PPPA2027, PPPA-2021-AIPC #LC-01815952/101052609 to (MA) and by the 2021 FAQC program of the Universitá degli Studi di Milano-Bicocca to (MA). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.