Intricacies of single-cell multi-omics data integration

Trends Genet. 2022 Feb;38(2):128-139. doi: 10.1016/j.tig.2021.08.012. Epub 2021 Sep 21.

Abstract

A wealth of single-cell protocols makes it possible to characterize different molecular layers at unprecedented resolution. Integrating the resulting multimodal single-cell data to find cell-to-cell correspondences remains a challenge. We argue that data integration needs to happen at a meaningful biological level of abstraction and that it is necessary to consider the inherent discrepancies between modalities to strike a balance between biological discovery and noise removal. A survey of current methods reveals that a distinction between technical and biological origins of presumed unwanted variation between datasets is not yet commonly considered. The increasing availability of paired multimodal data will aid the development of improved methods by providing a ground truth on cell-to-cell matches.

Keywords: cell type identity; method development; multi-omics data integration; multimodal data integration; single-cell multi-omic assays; single-cell omics.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't