Identity-by-descent-based phasing and imputation in founder populations using graphical models

Genet Epidemiol. 2011 Dec;35(8):853-60. doi: 10.1002/gepi.20635. Epub 2011 Oct 17.

Abstract

Accurate knowledge of haplotypes, the combination of alleles co-residing on a single copy of a chromosome, enables powerful gene mapping and sequence imputation methods. Since humans are diploid, haplotypes must be derived from genotypes by a phasing process. In this study, we present a new computational model for haplotype phasing based on pairwise sharing of haplotypes inferred to be Identical-By-Descent (IBD). We apply the Bayesian network based model in a new phasing algorithm, called systematic long-range phasing (SLRP), that can capitalize on the close genetic relationships in isolated founder populations, and show with simulated and real genome-wide genotype data that SLRP substantially reduces the rate of phasing errors compared to previous phasing algorithms. Furthermore, the method accurately identifies regions of IBD, enabling linkage-like studies without pedigrees, and can be used to impute most genotypes with very low error rate.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Bayes Theorem
  • Computer Graphics
  • Computer Simulation
  • Genetics, Population*
  • Haplotypes
  • Humans
  • Models, Genetic*
  • Polymorphism, Single Nucleotide
  • Scotland