A Novel Pathway-Based Approach Improves Lung Cancer Risk Prediction Using Germline Genetic Variations

Cancer Epidemiol Biomarkers Prev. 2016 Aug;25(8):1208-15. doi: 10.1158/1055-9965.EPI-15-1318. Epub 2016 May 24.

Abstract

Background: Although genome-wide association studies (GWAS) have identified many genetic variants that are strongly associated with lung cancer, these variants have low penetrance and serve as poor predictors of lung cancer in individuals. We sought to increase the predictive value of germline variants by considering their cumulative effects in the context of biologic pathways.

Methods: For individuals in the Environment and Genetics in Lung Cancer Etiology study (1,815 cases/1,971 controls), we computed pathway-level susceptibility effects as the sum of relevant SNP variant alleles weighted by their log-additive effects from a separate lung cancer GWAS meta-analysis (7,766 cases/37,482 controls). Logistic regression models based on age, sex, smoking, genetic variants, and principal components of pathway effects and pathway-smoking interactions were trained and optimized in cross-validation and further tested on an independent dataset (556 cases/830 controls). We assessed prediction performance using area under the receiver operating characteristic curve (AUC).

Results: Compared with typical binomial prediction models that have epidemiologic predictors (AUC = 0.607) in addition to top GWAS variants (AUC = 0.617), our pathway-based smoking-interactive multinomial model significantly improved prediction performance in external validation (AUC = 0.656, P < 0.0001).

Conclusions: Our biologically informed approach demonstrated a larger increase in AUC over nongenetic counterpart models relative to previous approaches that incorporate variants.

Impact: This model is the first of its kind to evaluate lung cancer prediction using subtype-stratified genetic effects organized into pathways and interacted with smoking. We propose pathway-exposure interactions as a potentially powerful new contributor to risk inference. Cancer Epidemiol Biomarkers Prev; 25(8); 1208-15. ©2016 AACR.

MeSH terms

  • Aged
  • Aged, 80 and over
  • Case-Control Studies
  • Female
  • Genetic Predisposition to Disease*
  • Genetic Variation*
  • Humans
  • Lung Neoplasms / genetics*
  • Male
  • Middle Aged
  • Odds Ratio
  • Polymorphism, Single Nucleotide
  • ROC Curve
  • Risk Factors
  • Smoking / epidemiology*