Spatial domain detection using contrastive self-supervised learning for spatial multi-omics technologies

bioRxiv [Preprint]. 2024 Feb 4:2024.02.02.578662. doi: 10.1101/2024.02.02.578662.

Abstract

Recent advances in spatially-resolved single-omics and multi-omics technologies have led to the emergence of computational tools to detect or predict spatial domains. Additionally, histological images and immunofluorescence (IF) staining of proteins and cell types provide multiple perspectives and a more complete understanding of tissue architecture. Here, we introduce Proust, a scalable tool to predict discrete domains using spatial multi-omics data by combining the low-dimensional representation of biological profiles based on graph-based contrastive self-supervised learning. Our scalable method integrates multiple data modalities, such as RNA, protein, and H&E images, and predicts spatial domains within tissue samples. Through the integration of multiple modalities, Proust consistently demonstrates enhanced accuracy in detecting spatial domains, as evidenced across various benchmark datasets and technological platforms.

Keywords: Spatially-resolved transcriptomics; contrastive learning; graph-based autoencoder; immunofluorescence images; multi-modal learning; spatial clustering.

Publication types

  • Preprint