A single cell RNAseq benchmark experiment embedding "controlled" cancer heterogeneity

Sci Data. 2024 Feb 2;11(1):159. doi: 10.1038/s41597-024-03002-y.

Abstract

Single-cell RNA sequencing (scRNA-seq) has emerged as a vital tool in tumour research, enabling the exploration of molecular complexities at the individual cell level. It offers new technical possibilities for advancing tumour research with the potential to yield significant breakthroughs. However, deciphering meaningful insights from scRNA-seq data poses challenges, particularly in cell annotation and tumour subpopulation identification. Efficient algorithms are therefore needed to unravel the intricate biological processes of cancer. To address these challenges, benchmarking datasets are essential to validate bioinformatics methodologies for analysing single-cell omics in oncology. Here, we present a 10XGenomics scRNA-seq experiment, providing a controlled heterogeneous environment using lung cancer cell lines characterised by the expression of seven different driver genes (EGFR, ALK, MET, ERBB2, KRAS, BRAF, ROS1), leading to partially overlapping functional pathways. Our dataset provides a comprehensive framework for the development and validation of methodologies for analysing cancer heterogeneity by means of scRNA-seq.

Publication types

  • Dataset

MeSH terms

  • Algorithms
  • Benchmarking*
  • Cell Line, Tumor
  • Gene Expression Profiling / methods
  • Humans
  • Lung Neoplasms* / genetics
  • Proto-Oncogene Proteins / genetics
  • Sequence Analysis, RNA / methods
  • Single-Cell Gene Expression Analysis

Substances

  • Proto-Oncogene Proteins