hECA: The cell-centric assembly of a cell atlas

iScience. 2022 Apr 28;25(5):104318. doi: 10.1016/j.isci.2022.104318. eCollection 2022 May 20.

Abstract

The accumulation of massive single-cell omics data provides growing resources for building biomolecular atlases of all cells of human organs or the whole body. The true assembly of a cell atlas should be cell-centric rather than file-centric. We developed a unified informatics framework for seamless cell-centric data assembly and built the human Ensemble Cell Atlas (hECA) from scattered data. hECA v1.0 assembled 1,093,299 labeled human cells from 116 published datasets, covering 38 organs and 11 systems. We invented three new methods of atlas applications based on the cell-centric assembly: "in data" cell sorting for targeted data retrieval with customizable logic expressions, "quantitative portraiture" for multi-view representations of biological entities, and customizable reference creation for generating references for automatic annotations. Case studies on agile construction of user-defined sub-atlases and "in data" investigation of CAR-T off-targets in multiple organs showed the great potential enabled by the cell-centric ensemble atlas.

Keywords: Bioinformatics; Cell biology; Stem cells research.