Iterative point set registration for aligning scRNA-seq data

PLoS Comput Biol. 2020 Oct 27;16(10):e1007939. doi: 10.1371/journal.pcbi.1007939. eCollection 2020 Oct.

Abstract

Several studies profile similar single cell RNA-Seq (scRNA-Seq) data using different technologies and platforms. A number of alignment methods have been developed to enable the integration and comparison of scRNA-Seq data from such studies. While each performs well on some of the datasets, to date no method was able to both perform the alignment using the original expression space and generalize to new data. To enable such analysis we developed Single Cell Iterative Point set Registration (SCIPR) which extends methods that were successfully applied to align image data to scRNA-Seq. We discuss the required changes needed, the resulting optimization function, and algorithms for learning a transformation function for aligning data. We tested SCIPR on several scRNA-Seq datasets. As we show it successfully aligns data from several different cell types, improving upon prior methods proposed for this task. In addition, we show the parameters learned by SCIPR can be used to align data not used in the training and to identify key cell type-specific genes.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Cell Line
  • Computational Biology
  • Gene Expression Profiling / methods*
  • Humans
  • RNA, Small Cytoplasmic / genetics*
  • Sequence Alignment / methods*
  • Sequence Analysis, RNA / methods*
  • Single-Cell Analysis / methods*

Substances

  • RNA, Small Cytoplasmic