Sparsely-connected autoencoder (SCA) for single cell RNAseq data mining

NPJ Syst Biol Appl. 2021 Jan 5;7(1):1. doi: 10.1038/s41540-020-00162-6.

Abstract

Single-cell RNA sequencing (scRNAseq) is an essential tool to investigate cellular heterogeneity. Thus, it would be of great interest being able to disclose biological information belonging to cell subpopulations, which can be defined by clustering analysis of scRNAseq data. In this manuscript, we report a tool that we developed for the functional mining of single cell clusters based on Sparsely-Connected Autoencoder (SCA). This tool allows uncovering hidden features associated with scRNAseq data. We implemented two new metrics, QCC (Quality Control of Cluster) and QCM (Quality Control of Model), which allow quantifying the ability of SCA to reconstruct valuable cell clusters and to evaluate the quality of the neural network achievements, respectively. Our data indicate that SCA encoded space, derived by different experimentally validated data (TF targets, miRNA targets, Kinase targets, and cancer-related immune signatures), can be used to grasp single cell cluster-specific functional features. In our implementation, SCA efficacy comes from its ability to reconstruct only specific clusters, thus indicating only those clusters where the SCA encoding space is a key element for cells aggregation. SCA analysis is implemented as module in rCASC framework and it is supported by a GUI to simplify it usage for biologists and medical personnel.

MeSH terms

  • Algorithms
  • Base Sequence / genetics
  • Cluster Analysis
  • Data Mining / methods*
  • Exome Sequencing / methods
  • Humans
  • Neural Networks, Computer
  • Sequence Analysis, RNA / methods*
  • Single-Cell Analysis / methods*
  • Software
  • Systems Biology / methods