Accurate estimation of cell-type composition from gene expression data

Nat Commun. 2019 Jul 5;10(1):2975. doi: 10.1038/s41467-019-10802-z.

Abstract

The rapid development of single-cell transcriptomic technologies has helped uncover the cellular heterogeneity within cell populations. However, bulk RNA-seq continues to be the main workhorse for quantifying gene expression levels due to technical simplicity and low cost. To most effectively extract information from bulk data given the new knowledge gained from single-cell methods, we have developed a novel algorithm to estimate the cell-type composition of bulk data from a single-cell RNA-seq-derived cell-type signature. Comparison with existing methods using various real RNA-seq data sets indicates that our new approach is more accurate and comprehensive than previous methods, especially for the estimation of rare cell types. More importantly, our method can detect cell-type composition changes in response to external perturbations, thereby providing a valuable, cost-effective method for dissecting the cell-type-specific effects of drug treatments or condition changes. As such, our method is applicable to a wide range of biological and clinical investigations.

Publication types

  • Comparative Study

MeSH terms

  • Algorithms
  • Datasets as Topic
  • Feasibility Studies
  • Female
  • Gene Expression Profiling / methods*
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • Least-Squares Analysis
  • Melanoma / genetics
  • Ovarian Neoplasms / genetics
  • Sequence Analysis, RNA / methods*
  • Single-Cell Analysis / methods*
  • Skin Neoplasms / genetics
  • Transcriptome / genetics