Haplotyping for disease association: a combinatorial approach

IEEE/ACM Trans Comput Biol Bioinform. 2008 Apr-Jun;5(2):245-51. doi: 10.1109/TCBB.2007.70255.

Abstract

We consider a combinatorial problem derived from haplotyping a population with respect to a genetic disease, either recessive or dominant. Given a set of individuals, partitioned into healthy and diseased, and the corresponding sets of genotypes, we want to infer "bad'' and "good'' haplotypes to account for these genotypes and for the disease. Assume e.g. the disease is recessive. Then, the resolving haplotypes must consist of bad and good haplotypes, so that (i) each genotype belonging to a diseased individual is explained by a pair of bad haplotypes and (ii) each genotype belonging to a healthy individual is explained by a pair of haplotypes of which at least one is good. We prove that the associated decision problem is NP-complete. However, we also prove that there is a simple solution, provided the data satisfy a very weak requirement.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Computational Biology
  • Female
  • Genetic Diseases, Inborn / genetics*
  • Genetic Predisposition to Disease
  • Genotype
  • Haplotypes / genetics*
  • Humans
  • Male
  • Mathematics
  • Models, Genetic*
  • Polymorphism, Single Nucleotide