Family trio phasing and missing data recovery

Int J Bioinform Res Appl. 2005;1(2):221-9. doi: 10.1504/IJBRA.2005.007580.

Abstract

Although there exist many phasing methods for unrelated adults or pedigrees, phasing and missing data recovery for data representing family trios is lagging behind. This paper is an attempt to fill this gap by considering the following problem. Given a set of genotypes partitioned into family trios, find for each trio a quartet of parent/offspring haplotypes explaining each trio without recombinations and recovering the SNP values missed in given genotype data. Our contributions include: formulating the pure-parsimony trio phasing without recombinations and the trio missing data recovery problems; proposing new greedy and integer linear programming based solution methods; extensive experimental validation of proposed methods showing advantage over the previously known methods.

MeSH terms

  • Genotype*
  • Haplotypes*
  • Humans
  • Models, Genetic
  • Pedigree