qSVA framework for RNA quality correction in differential expression analysis

Proc Natl Acad Sci U S A. 2017 Jul 3;114(27):7130-7135. doi: 10.1073/pnas.1617384114. Epub 2017 Jun 20.

Abstract

RNA sequencing (RNA-seq) is a powerful approach for measuring gene expression levels in cells and tissues, but it relies on high-quality RNA. We demonstrate here that statistical adjustment using existing quality measures largely fails to remove the effects of RNA degradation when RNA quality associates with the outcome of interest. Using RNA-seq data from molecular degradation experiments of human primary tissues, we introduce a method-quality surrogate variable analysis (qSVA)-as a framework for estimating and removing the confounding effect of RNA quality in differential expression analysis. We show that this approach results in greatly improved replication rates (>3×) across two large independent postmortem human brain studies of schizophrenia and also removes potential RNA quality biases in earlier published work that compared expression levels of different brain regions and other diagnostic groups. Our approach can therefore improve the interpretation of differential expression analysis of transcriptomic data from human tissue.

Keywords: RNA quality; RNA sequencing; differential expression analysis; statistical modeling.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Computational Biology
  • DNA Replication
  • Gene Expression Profiling
  • Gene Expression Regulation
  • Genotype
  • Gray Matter
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Oligonucleotide Array Sequence Analysis
  • RNA / analysis*
  • RNA / genetics
  • Schizophrenia / genetics
  • Schizophrenia / metabolism
  • Sequence Analysis, RNA / methods*
  • Transcriptome

Substances

  • RNA