Highly Regional Genes: graph-based gene selection for single-cell RNA-seq data

J Genet Genomics. 2022 Sep;49(9):891-899. doi: 10.1016/j.jgg.2022.01.004. Epub 2022 Feb 8.

Abstract

Gene selection is an indispensable step for analyzing noisy and high-dimensional single-cell RNA-seq (scRNA-seq) data. Compared with the commonly used variance-based methods, by mimicking the human maker selection in the 2D visualization of cells, a new feature selection method called HRG (Highly Regional Genes) is proposed to find the informative genes, which show regional expression patterns in the cell-cell similarity network. We mathematically find the optimal expression patterns that can maximize the proposed scoring function. In comparison with several unsupervised methods, HRG shows high accuracy and robustness, and can increase the performance of downstream cell clustering and gene correlation analysis. Also, it is applicable for selecting informative genes of sequencing-based spatial transcriptomic data.

Keywords: Feature selection; Graphical models; Regional patterns; Single-cell RNA-sequencing; Spatially resolved transcriptomic data.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Gene Expression Profiling / methods
  • Humans
  • RNA-Seq
  • Sequence Analysis, RNA / methods
  • Single-Cell Analysis* / methods
  • Transcriptome* / genetics