Single Nucleotide Polymorphism Identification in Polyploids: A Review, Example, and Recommendations

Mol Plant. 2015 Jun;8(6):831-46. doi: 10.1016/j.molp.2015.02.002. Epub 2015 Feb 10.

Abstract

Understanding the relationship between genotype and phenotype is a major biological question and being able to predict phenotypes based on molecular genotypes is integral to molecular breeding. Whole-genome duplications have shaped the history of all flowering plants and present challenges to elucidating the relationship between genotype and phenotype, especially in neopolyploid species. Although single nucleotide polymorphisms (SNPs) have become popular tools for genetic mapping, discovery and application of SNPs in polyploids has been difficult. Here, we summarize common experimental approaches to SNP calling, highlighting recent polyploid successes. To examine the impact of software choice on these analyses, we called SNPs among five peanut genotypes using different alignment programs (BWA-mem and Bowtie 2) and variant callers (SAMtools, GATK, and Freebayes). Alignments produced by Bowtie 2 and BWA-mem and analyzed in SAMtools shared 24.5% concordant SNPs, and SAMtools, GATK, and Freebayes shared 1.4% concordant SNPs. A subsequent analysis of simulated Brassica napus chromosome 1A and 1C genotypes demonstrated that, of the three software programs, SAMtools performed with the highest sensitivity and specificity on Bowtie 2 alignments. These results, however, are likely to vary among species, and we therefore propose a series of best practices for SNP calling in polyploids.

Keywords: genomics; homeolog; next-generation sequencing (NGS); peanut; polyploid; single nucleotide polymorphism (SNP).

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Arachis / genetics*
  • Brassica napus / genetics*
  • High-Throughput Nucleotide Sequencing
  • Plant Proteins / genetics*
  • Polymorphism, Single Nucleotide*
  • Polyploidy*

Substances

  • Plant Proteins