Clustering of haplotypes based on phylogeny: how good a strategy for association testing?

Claire Bardel; Pierre Darlu; Emmanuelle Génin

doi:10.1038/sj.ejhg.5201501

Clustering of haplotypes based on phylogeny: how good a strategy for association testing?

Eur J Hum Genet. 2006 Feb;14(2):202-6. doi: 10.1038/sj.ejhg.5201501.

Authors

Claire Bardel¹, Pierre Darlu, Emmanuelle Génin

Affiliation

¹ INSERM U535, Hôpital Paul Brousse, Villejuif, France. bardel@vjf.inserm.fr

PMID: 16306882
DOI: 10.1038/sj.ejhg.5201501

Abstract

Haplotypes are now widely used in association studies between markers and disease susceptibility locus. However, when a large number of markers are considered, the number of possible haplotypes increases leading to two problems: an increased number of degrees of freedom that may result in a lack of power and the existence of rare haplotypes that may be difficult to take into account in the statistical analysis. In a recent paper, Durrant et al proposed a method, CLADHC, to group haplotypes based on distance matrices and showed that this could considerably increase the power of the association test as compared to either single-locus analysis or haplotype analysis without prior grouping. Although the authors considered different one-disease-locus susceptibility models in their simulations, they did not study the impact of the linkage disequilibrium (LD) pattern and of the susceptibility allele frequency on their conclusions. Here, we show, using haplotype data from five regions of the genome of different lengths and with different LD patterns, that, when a single disease susceptibility locus is simulated, the prior grouping of haplotypes based on the algorithm of Durrant et al does not increase the power of association testing except in very particular situations of LD patterns and allele frequencies.

Publication types

Comparative Study

MeSH terms

Computer Simulation
Data Interpretation, Statistical*
Europe
Gene Frequency
Genetic Predisposition to Disease / genetics*
Haplotypes / genetics*
Humans
Linkage Disequilibrium
Models, Genetic*
Phylogeny*