Optimal genotype determination in highly multiplexed SNP data

Eur J Hum Genet. 2006 Feb;14(2):207-15. doi: 10.1038/sj.ejhg.5201528.

Abstract

High-throughput genotyping technologies that enable large association studies are already available. Tools for genotype determination starting from raw signal intensities need to be automated, robust, and flexible to provide optimal genotype determination given the specific requirements of a study. The key metrics describing the performance of a custom genotyping study are assay conversion, call rate, and genotype accuracy. These three metrics can be traded off against each other. Using the highly multiplexed Molecular Inversion Probe technology as an example, we describe a methodology for identifying the optimal trade-off. The methodology comprises: a robust clustering algorithm and assessment of a large number of data filter sets. The clustering algorithm allows for automatic genotype determination. Many different sets of filters are then applied to the clustered data, and performance metrics resulting from each filter set are calculated. These performance metrics relate to the power of a study and provide a framework to choose the most suitable filter set to the particular study.

Publication types

  • Comparative Study

MeSH terms

  • Algorithms*
  • Cluster Analysis
  • Computer Simulation
  • Genetic Markers / genetics
  • Genotype
  • Molecular Probe Techniques*
  • Polymorphism, Single Nucleotide / genetics*
  • Reproducibility of Results
  • Research Design*

Substances

  • Genetic Markers