The analysis of disease biomarker data using a mixed hidden Markov model (Open Access publication)

Genet Sel Evol. 2008 Sep-Oct;40(5):491-509. doi: 10.1186/1297-9686-40-5-491. Epub 2008 Aug 12.

Abstract

A mixed hidden Markov model (HMM) was developed for predicting breeding values of a biomarker (here, somatic cell score) and the individual probabilities of health and disease (here, mastitis) based upon the measurements of the biomarker. At a first level, the unobserved disease process (Markov model) was introduced and at a second level, the measurement process was modeled, making the link between the unobserved disease states and the observed biomarker values. This hierarchical formulation allows joint estimation of the parameters of both processes. The flexibility of this approach is illustrated on the simulated data. Firstly, lactation curves for the biomarker were generated based upon published parameters (mean, variance, and probabilities of infection) for cows with known clinical conditions (health or mastitis due to Escherichia coli or Staphylococcus aureus). Next, estimation of the parameters was performed via Gibbs sampling, assuming the health status was unknown. Results from the simulations and mathematics show that the mixed HMM is appropriate to estimate the quantities of interest although the accuracy of the estimates is moderate when the prevalence of the disease is low. The paper ends with some indications for further developments of the methodology.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Biomarkers / analysis*
  • Cattle / genetics*
  • Computer Simulation
  • Dairying
  • Escherichia coli Infections / genetics
  • Female
  • Genetic Predisposition to Disease
  • Lactation / genetics
  • Markov Chains*
  • Mastitis, Bovine / genetics*
  • Models, Theoretical
  • Staphylococcal Infections / genetics
  • Staphylococcus aureus / physiology

Substances

  • Biomarkers