Exploratory Bayesian model selection for serial genetics data

Biometrics. 2005 Jun;61(2):591-9. doi: 10.1111/j.1541-0420.2005.040417.x.

Abstract

Characterizing the process by which molecular and cellular level changes occur over time will have broad implications for clinical decision making and help further our knowledge of disease etiology across many complex diseases. However, this presents an analytic challenge due to the large number of potentially relevant biomarkers and the complex, uncharacterized relationships among them. We propose an exploratory Bayesian model selection procedure that searches for model simplicity through independence testing of multiple discrete biomarkers measured over time. Bayes factor calculations are used to identify and compare models that are best supported by the data. For large model spaces, i.e., a large number of multi-leveled biomarkers, we propose a Markov chain Monte Carlo (MCMC) stochastic search algorithm for finding promising models. We apply our procedure to explore the extent to which HIV-1 genetic changes occur independently over time.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Algorithms
  • Bayes Theorem
  • Biomarkers
  • DNA Mutational Analysis
  • HIV / genetics*
  • HIV Infections / virology
  • HIV Protease / genetics
  • Humans
  • Markov Chains
  • Models, Genetic*
  • Models, Statistical
  • Monte Carlo Method
  • Mutation
  • Selection, Genetic

Substances

  • Biomarkers
  • HIV Protease