Efficient exact maximum a posteriori computation for bayesian SNP genotyping in polyploids

PLoS One. 2012;7(2):e30906. doi: 10.1371/journal.pone.0030906. Epub 2012 Feb 17.

Abstract

The problem of genotyping polyploids is extremely important for the creation of genetic maps and assembly of complex plant genomes. Despite its significance, polyploid genotyping still remains largely unsolved and suffers from a lack of statistical formality. In this paper a graphical bayesian model for SNP genotyping data is introduced. This model can infer genotypes even when the ploidy of the population is unknown. We also introduce an algorithm for finding the exact maximum a posteriori genotype configuration with this model. This algorithm is implemented in a freely available web-based software package SuperMASSA. We demonstrate the utility, efficiency, and flexibility of the model and algorithm by applying them to two different platforms, each of which is applied to a polyploid data set: Illumina GoldenGate data from potato and Sequenom MassARRAY data from sugarcane. Our method achieves state-of-the-art performance on both data sets and can be trivially adapted to use models that utilize prior information about any platform or species.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Bayes Theorem
  • Computational Biology / methods*
  • Databases, Genetic
  • Genetic Loci / genetics
  • Genotyping Techniques / methods*
  • Polymorphism, Single Nucleotide / genetics*
  • Polyploidy*
  • Saccharum / genetics*
  • Software
  • Solanum tuberosum / genetics*
  • Time Factors