An integrated approach for the analysis of biological pathways using mixed models

PLoS Genet. 2008 Jul;4(7):e1000115. doi: 10.1371/journal.pgen.1000115. Epub 2008 Jul 4.

Abstract

Gene class, ontology, or pathway testing analysis has become increasingly popular in microarray data analysis. Such approaches allow the integration of gene annotation databases, such as Gene Ontology and KEGG Pathway, to formally test for subtle but coordinated changes at a system level. Higher power in gene class testing is gained by combining weak signals from a number of individual genes in each pathway. We propose an alternative approach for gene-class testing based on mixed models, a class of statistical models that: a) provides the ability to model and borrow strength across genes that are both up and down in a pathway, b) operates within a well-established statistical framework amenable to direct control of false positive or false discovery rates, c) exhibits improved power over widely used methods under normal location-based alternative hypotheses, and d) handles complex experimental designs for which permutation resampling is difficult. We compare the properties of this mixed models approach with nonparametric method GSEA and parametric method PAGE using a simulation study, and illustrate its application with a diabetes data set and a dose-response data set.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aldehydes / administration & dosage
  • Area Under Curve
  • Cell Line, Tumor
  • Databases, Genetic
  • Diabetes Mellitus / genetics
  • Dose-Response Relationship, Drug
  • Gene Expression / drug effects
  • Gene Expression Profiling / statistics & numerical data
  • Humans
  • Linear Models
  • Models, Genetic*
  • Models, Statistical
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data*
  • Sensitivity and Specificity
  • Systems Biology

Substances

  • Aldehydes
  • 4-hydroxy-2-nonenal