scCASE: accurate and interpretable enhancement for single-cell chromatin accessibility sequencing data

Nat Commun. 2024 Feb 22;15(1):1629. doi: 10.1038/s41467-024-46045-w.

Abstract

Single-cell chromatin accessibility sequencing (scCAS) has emerged as a valuable tool for interrogating and elucidating epigenomic heterogeneity and gene regulation. However, scCAS data inherently suffers from limitations such as high sparsity and dimensionality, which pose significant challenges for downstream analyses. Although several methods are proposed to enhance scCAS data, there are still challenges and limitations that hinder the effectiveness of these methods. Here, we propose scCASE, a scCAS data enhancement method based on non-negative matrix factorization which incorporates an iteratively updating cell-to-cell similarity matrix. Through comprehensive experiments on multiple datasets, we demonstrate the advantages of scCASE over existing methods for scCAS data enhancement. The interpretable cell type-specific peaks identified by scCASE can provide valuable biological insights into cell subpopulations. Moreover, to leverage the large compendia of available omics data as a reference, we further expand scCASE to scCASER, which enables the incorporation of external reference data to improve enhancement performance.

MeSH terms

  • Algorithms*
  • Chromatin* / genetics
  • Epigenomics / methods
  • Gene Expression Regulation
  • Single-Cell Analysis

Substances

  • Chromatin