Sequencing dropout-and-batch effect normalization for single-cell mRNA profiles: a survey and comparative analysis

Brief Bioinform. 2021 Jul 20;22(4):bbaa248. doi: 10.1093/bib/bbaa248.

Abstract

Single-cell mRNA sequencing has been adopted as a powerful technique for understanding gene expression profiles at the single-cell level. However, challenges remain due to factors such as the inefficiency of mRNA molecular capture, technical noises and separate sequencing of cells in different batches. Normalization methods have been developed to ensure a relatively accurate analysis. This work presents a survey on 10 tools specifically designed for single-cell mRNA sequencing data preprocessing steps, among which 6 tools are used for dropout normalization and 4 tools are for batch effect correction. In this survey, we outline the main methodology for each of these tools, and we also compare these tools to evaluate their normalization performance on datasets which are simulated under the constraints of dropout inefficiency, batch effect or their combined effects. We found that Saver and Baynorm performed better than other methods in dropout normalization, in most cases. Beer and Batchelor performed better in the batch effect normalization, and the Saver-Beer tool combination and the Baynorm-Beer combination performed better in the mixed dropout-and-batch effect normalization. Over-normalization is a common issue occurred to these dropout normalization tools that is worth of future investigation. For the batch normalization tools, the capability of retaining heterogeneity between different groups of cells after normalization can be another direction for future improvement.

Keywords: batch effect; dropout; normalization tools; performance evaluation; single-cell mRNA sequencing.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Gene Expression Profiling*
  • Oligonucleotide Array Sequence Analysis*
  • RNA, Messenger* / biosynthesis
  • RNA, Messenger* / genetics
  • Single-Cell Analysis*
  • Software*
  • Transcriptome*

Substances

  • RNA, Messenger