A multicenter study benchmarking single-cell RNA sequencing technologies using reference samples

Wanqiu Chen; Yongmei Zhao; Xin Chen; Zhaowei Yang; Xiaojiang Xu; Yingtao Bi; Vicky Chen; Jing Li; Hannah Choi; Ben Ernest; Bao Tran; Monika Mehta; Parimal Kumar; Andrew Farmer; Alain Mir; Urvashi Ann Mehra; Jian-Liang Li; Malcolm Moos Jr; Wenming Xiao; Charles Wang

doi:10.1038/s41587-020-00748-9

A multicenter study benchmarking single-cell RNA sequencing technologies using reference samples

Nat Biotechnol. 2021 Sep;39(9):1103-1114. doi: 10.1038/s41587-020-00748-9. Epub 2020 Dec 21.

Authors

Wanqiu Chen^#¹, Yongmei Zhao^#^{2

3}, Xin Chen^#^{1

4}, Zhaowei Yang^#^{1

5}, Xiaojiang Xu⁶, Yingtao Bi⁷, Vicky Chen^{2

3}, Jing Li^{4

5}, Hannah Choi¹, Ben Ernest⁸, Bao Tran³, Monika Mehta³, Parimal Kumar³, Andrew Farmer⁹, Alain Mir⁹, Urvashi Ann Mehra⁸, Jian-Liang Li⁶, Malcolm Moos Jr¹⁰, Wenming Xiao¹¹, Charles Wang^{12

13}

Affiliations

¹ Center for Genomics, School of Medicine, Loma Linda University, Loma Linda, CA, USA.
² CCR-SF Bioinformatics Group, Advanced Biomedical and Computational Sciences, Biomedical Informatics and Data Science Directorate, Frederick National Laboratory for Cancer Research, Frederick, MD, USA.
³ Sequencing Facility, Frederick National Laboratory for Cancer Research, Frederick, MD, USA.
⁴ Department of Basic Sciences, School of Medicine, Loma Linda University, Loma Linda, CA, USA.
⁵ Department of Allergy and Clinical Immunology, State Key Laboratory of Respiratory Disease, Guangzhou Institute of Respiratory Health, the First Affiliated Hospital of Guangzhou Medical University, Guangzhou, People's Republic of China.
⁶ Integrative Bioinformatics Support Group, National Institute of Environment Health Sciences, Research Triangle Park, NC, USA.
⁷ Abbvie Cambridge Research Center, Cambridge, MA, USA.
⁸ Digicon Corporation, McLean, VA, USA.
⁹ Takara Bio USA, Inc., Mountain View, CA, USA.
¹⁰ Center for Biologics Evaluation and Research & Division of Cellular and Gene Therapies, U.S. Food and Drug Administration, Silver Spring, MD, USA.
¹¹ The Center for Devices and Radiological Health, U.S. Food and Drug Administration, Silver Spring, MD, USA. wenming.xiao@fda.hhs.gov.
¹² Center for Genomics, School of Medicine, Loma Linda University, Loma Linda, CA, USA. oxwang@gmail.com.
¹³ Department of Basic Sciences, School of Medicine, Loma Linda University, Loma Linda, CA, USA. oxwang@gmail.com.

^# Contributed equally.

PMID: 33349700
DOI: 10.1038/s41587-020-00748-9

Abstract

Comparing diverse single-cell RNA sequencing (scRNA-seq) datasets generated by different technologies and in different laboratories remains a major challenge. Here we address the need for guidance in choosing algorithms leading to accurate biological interpretations of varied data types acquired with different platforms. Using two well-characterized cellular reference samples (breast cancer cells and B cells), captured either separately or in mixtures, we compared different scRNA-seq platforms and several preprocessing, normalization and batch-effect correction methods at multiple centers. Although preprocessing and normalization contributed to variability in gene detection and cell classification, batch-effect correction was by far the most important factor in correctly classifying the cells. Moreover, scRNA-seq dataset characteristics (for example, sample and cellular heterogeneity and platform used) were critical in determining the optimal bioinformatic method. However, reproducibility across centers and platforms was high when appropriate bioinformatic methods were applied. Our findings offer practical guidance for optimizing platform and software selection when designing an scRNA-seq study.

Publication types

Multicenter Study
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
B-Lymphocytes
Benchmarking*
Breast Neoplasms
Cell Line, Tumor
Datasets as Topic
Female
Gene Expression Profiling / methods
Gene Expression Profiling / standards
Humans
Sequence Analysis, RNA / methods
Sequence Analysis, RNA / standards*
Single-Cell Analysis / methods
Single-Cell Analysis / standards*

Abstract

Publication types

MeSH terms

Grants and funding