A topology-preserving dimensionality reduction method for single-cell RNA-seq data using graph autoencoder

Sci Rep. 2021 Oct 8;11(1):20028. doi: 10.1038/s41598-021-99003-7.

Abstract

Dimensionality reduction is crucial for the visualization and interpretation of the high-dimensional single-cell RNA sequencing (scRNA-seq) data. However, preserving topological structure among cells to low dimensional space remains a challenge. Here, we present the single-cell graph autoencoder (scGAE), a dimensionality reduction method that preserves topological structure in scRNA-seq data. scGAE builds a cell graph and uses a multitask-oriented graph autoencoder to preserve topological structure information and feature information in scRNA-seq data simultaneously. We further extended scGAE for scRNA-seq data visualization, clustering, and trajectory inference. Analyses of simulated data showed that scGAE accurately reconstructs developmental trajectory and separates discrete cell clusters under different scenarios, outperforming recently developed deep learning methods. Furthermore, implementation of scGAE on empirical data showed scGAE provided novel insights into cell developmental lineages and preserved inter-cluster distances.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Data Mining / methods
  • Data Visualization*
  • Electronic Data Processing / methods
  • RNA-Seq / methods*
  • Sequence Analysis, RNA / methods
  • Single-Cell Analysis / methods*