Decision tree-based modeling of androgen pathway genes and prostate cancer risk

Cancer Epidemiol Biomarkers Prev. 2011 Jun;20(6):1146-55. doi: 10.1158/1055-9965.EPI-10-0996. Epub 2011 Apr 14.

Abstract

Background: Inherited variability in genes that influence androgen metabolism has been associated with risk of prostate cancer. The objective of this analysis was to evaluate interactions for prostate cancer risk by using classification and regression tree (CART) models (i.e., decision trees), and to evaluate whether these interactive effects add information about prostate cancer risk prediction beyond that of "traditional" risk factors.

Methods: We compared CART models with traditional logistic regression (LR) models for associations of factors with prostate cancer risk using 1,084 prostate cancer cases and 941 controls. All analyses were stratified by race. We used unconditional LR to complement and compare with the race-stratified CART results using the area under curve (AUC) for the receiver operating characteristic curves.

Results: The CART modeling of prostate cancer risk showed different interaction profiles by race. For European Americans, interactions among CYP3A43 genotype, history of benign prostate hypertrophy, family history of prostate cancer, and age at consent revealed a distinct hierarchy of gene-environment and gene-gene interactions, whereas for African Americans, interactions among family history of prostate cancer, individual proportion of European ancestry, number of GGC androgen receptor repeats, and CYP3A4/CYP3A5 haplotype revealed distinct interaction effects from those found in European Americans. For European Americans, the CART model had the highest AUC whereas for African Americans, the LR model with the CART discovered factors had the largest AUC.

Conclusion and impact: These results provide new insight into underlying prostate cancer biology for European Americans and African Americans.

Publication types

  • Comparative Study
  • Multicenter Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • 3-Oxo-5-alpha-Steroid 4-Dehydrogenase / genetics
  • Adult
  • Aged
  • Aryl Hydrocarbon Hydroxylases / genetics
  • Case-Control Studies
  • Cytochrome P-450 CYP3A / genetics
  • DNA / genetics
  • Decision Support Techniques*
  • Decision Trees*
  • Haplotypes / genetics
  • Humans
  • Male
  • Membrane Proteins / genetics
  • Middle Aged
  • Neoplasm Staging
  • Polymerase Chain Reaction
  • Polymorphism, Restriction Fragment Length
  • Polymorphism, Single Nucleotide / genetics*
  • Prognosis
  • Prostate / metabolism*
  • Prostatic Neoplasms / genetics*
  • ROC Curve
  • Receptors, Androgen / genetics*
  • Repetitive Sequences, Nucleic Acid
  • Risk Factors

Substances

  • AR protein, human
  • Membrane Proteins
  • Receptors, Androgen
  • DNA
  • Aryl Hydrocarbon Hydroxylases
  • CYP3A43 protein, human
  • CYP3A5 protein, human
  • Cytochrome P-450 CYP3A
  • CYP3A4 protein, human
  • 3-Oxo-5-alpha-Steroid 4-Dehydrogenase
  • SRD5A2 protein, human