MMBGX: a method for estimating expression at the isoform level and detecting differential splicing using whole-transcript Affymetrix arrays

Nucleic Acids Res. 2010 Jan;38(1):e4. doi: 10.1093/nar/gkp853. Epub 2009 Oct 23.

Abstract

Affymetrix has recently developed whole-transcript GeneChips-'Gene' and 'Exon' arrays-which interrogate exons along the length of each gene. Although each probe on these arrays is intended to hybridize perfectly to only one transcriptional target, many probes match multiple transcripts located in different parts of the genome or alternative isoforms of the same gene. Existing statistical methods for estimating expression do not take this into account and are thus prone to producing inflated estimates. We propose a method, Multi-Mapping Bayesian Gene eXpression (MMBGX), which disaggregates the signal at 'multi-match' probes. When applied to Gene arrays, MMBGX removes the upward bias of gene-level expression estimates. When applied to Exon arrays, it can further disaggregate the signal between alternative transcripts of the same gene, providing expression estimates of individual splice variants. We demonstrate the performance of MMBGX on simulated data and a tissue mixture data set. We then show that MMBGX can estimate the expression of alternative isoforms within one experimental condition, confirming our results by RT-PCR. Finally, we show that our method for detecting differential splicing has a lower error rate than standard exon-level approaches on a previously validated colon cancer data set.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing*
  • Animals
  • Bayes Theorem
  • Colonic Neoplasms / genetics
  • Colonic Neoplasms / metabolism
  • Exons
  • Gene Expression Profiling / methods*
  • Humans
  • Male
  • Mice
  • Mice, Inbred C57BL
  • Oligonucleotide Array Sequence Analysis / methods*
  • Protein Isoforms / genetics*
  • Protein Isoforms / metabolism
  • Reverse Transcriptase Polymerase Chain Reaction
  • Software

Substances

  • Protein Isoforms