Accounting for cell type hierarchy in evaluating single cell RNA-seq clustering

Genome Biol. 2020 May 25;21(1):123. doi: 10.1186/s13059-020-02027-x.

Abstract

Cell clustering is one of the most common routines in single cell RNA-seq data analyses, for which a number of specialized methods are available. The evaluation of these methods ignores an important biological characteristic that the structure for a population of cells is hierarchical, which could result in misleading evaluation results. In this work, we develop two new metrics that take into account the hierarchical structure of cell types. We illustrate the application of the new metrics in constructed examples as well as several real single cell datasets and show that they provide more biologically plausible results.

Keywords: Clustering; Gene expression; Single cell RNA-seq.

Publication types

  • Evaluation Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Cluster Analysis*
  • Humans
  • Models, Genetic*
  • Sequence Analysis, RNA*
  • Single-Cell Analysis*