False disease region identification from identity-by-descent haplotype sharing in the presence of phenocopies

Twin Res Hum Genet. 2006 Feb;9(1):9-16. doi: 10.1375/183242706776403000.

Abstract

Linkage analysis (either parametric or nonparametric) is commonly applied to identify chromosomal regions using related individuals affected by disease. In complex disease the incomplete relationship between phenotype and genotype can be modeled using a phenocopy parameter, the probability that an individual is affected given they do not carry the disease mutation of interest, and a nonpenetrance parameter, the probability that an individual is not affected given they do carry the disease mutation of interest. If the linkage phase between multiple markers and a putative disease locus is known, then haplotypes carrying the mutation can, in principle, be identified by comparing the chromosome segments that are shared identical-by-descent (IBD) across affected individuals. We consider here the effect of a nonzero phenocopy rate on the linkage peak and hence upon the identification of disease haplotypes that are shared IBD between affected individuals. We show, by theory and computer simulation, that in diseases for which there is a nonzero phenocopy rate, the chromosomal regions identified may not include the true disease locus. We utilize a LOD-1 confidence interval for a widely used nonparametric linkage statistic. We find that in small/moderate samples this confidence interval may be inappropriate. We give specific examples where the phenocopy rates are nonnegligible in some complex diseases. The success of further work to identify the causal mutations underlying the linkage peaks in these diseases will depend on researchers allowing for the presence of phenocopies by examining appropriately wide regions around the initial positive linkage finding.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alleles
  • Computer Simulation
  • Genetic Linkage*
  • Genetic Markers
  • Genotype
  • Haplotypes / genetics*
  • Humans
  • Inheritance Patterns
  • Lod Score
  • Models, Genetic*
  • Mutation / genetics
  • Phenotype
  • Statistics, Nonparametric

Substances

  • Genetic Markers