ClonoCluster: A method for using clonal origin to inform transcriptome clustering

Cell Genom. 2023 Jan 12;3(2):100247. doi: 10.1016/j.xgen.2022.100247. eCollection 2023 Feb 8.

Abstract

Clustering cells based on their high-dimensional profiles is an important data reduction process by which researchers infer distinct cellular states. The advent of cellular barcoding, however, provides an alternative means by which to group cells: by their clonal origin. We developed ClonoCluster, a computational method that combines both clone and transcriptome information to create hybrid clusters that weight both kinds of data with a tunable parameter. We generated hybrid clusters across six independent datasets and found that ClonoCluster generated qualitatively different clusters in all cases. The markers of these hybrid clusters were different but had equivalent fidelity to transcriptome-only clusters. The genes most strongly associated with the rearrangements in hybrid clusters were ribosomal function and extracellular matrix genes. We also developed the complementary tool Warp Factor that incorporates clone information in popular 2D visualization techniques like UMAP. Integrating ClonoCluster and Warp Factor revealed biologically relevant markers of cell identity.

Keywords: Bioinformatics; clonality; clustering; lineage tracing; single cell.