Integrative modeling of multi-platform genomic data under the framework of mediation analysis

Stat Med. 2015 Jan 15;34(1):162-78. doi: 10.1002/sim.6326. Epub 2014 Oct 15.

Abstract

Given the availability of genomic data, there have been emerging interests in integrating multi-platform data. Here, we propose to model genetics (single nucleotide polymorphism (SNP)), epigenetics (DNA methylation), and gene expression data as a biological process to delineate phenotypic traits under the framework of causal mediation modeling. We propose a regression model for the joint effect of SNPs, methylation, gene expression, and their nonlinear interactions on the outcome and develop a variance component score test for any arbitrary set of regression coefficients. The test statistic under the null follows a mixture of chi-square distributions, which can be approximated using a characteristic function inversion method or a perturbation procedure. We construct tests for candidate models determined by different combinations of SNPs, DNA methylation, gene expression, and interactions and further propose an omnibus test to accommodate different models. We then study three path-specific effects: the direct effect of SNPs on the outcome, the effect mediated through expression, and the effect through methylation. We characterize correspondences between the three path-specific effects and coefficients in the regression model, which are influenced by causal relations among SNPs, DNA methylation, and gene expression. We illustrate the utility of our method in two genomic studies and numerical simulation studies.

Keywords: SNP-set analysis; causal inference; epigenetics; genetic association studies; path-specific effect; variance component test.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Asthma / genetics*
  • Computer Simulation
  • DNA Methylation
  • Epigenomics / methods
  • Epigenomics / statistics & numerical data*
  • GRB10 Adaptor Protein / genetics*
  • Gene Expression
  • Genome, Human
  • Genome-Wide Association Study
  • Glioblastoma / genetics*
  • Glioblastoma / mortality
  • Humans
  • Membrane Proteins / genetics*
  • Models, Genetic
  • Polymorphism, Single Nucleotide
  • Regression Analysis
  • Survival Analysis

Substances

  • GRB10 protein, human
  • Membrane Proteins
  • ORMDL3 protein, human
  • GRB10 Adaptor Protein