A review and performance evaluation of clustering frameworks for single-cell Hi-C data

Brief Bioinform. 2022 Nov 19;23(6):bbac385. doi: 10.1093/bib/bbac385.

Abstract

The three-dimensional genome structure plays a key role in cellular function and gene regulation. Single-cell Hi-C (high-resolution chromosome conformation capture) technology can capture genome structure information at the cell level, which provides the opportunity to study how genome structure varies among different cell types. Recently, a few methods are well designed for single-cell Hi-C clustering. In this manuscript, we perform an in-depth benchmark study of available single-cell Hi-C data clustering methods to implement an evaluation system for multiple clustering frameworks based on both human and mouse datasets. We compare eight methods in terms of visualization and clustering performance. Performance is evaluated using four benchmark metrics including adjusted rand index, normalized mutual information, homogeneity and Fowlkes-Mallows index. Furthermore, we also evaluate the eight methods for the task of separating cells at different stages of the cell cycle based on single-cell Hi-C data.

Keywords: clustering; feature extraction; single-cell Hi-C.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Chromatin*
  • Chromosomes*
  • Cluster Analysis
  • Genome
  • Humans
  • Mice
  • Molecular Conformation

Substances

  • Chromatin