Development, validation and genetic analysis of a large soybean SNP genotyping array

Plant J. 2015 Feb;81(4):625-36. doi: 10.1111/tpj.12755.

Abstract

Cultivated soybean (Glycine max) suffers from a narrow germplasm relative to other crop species, probably because of under-use of wild soybean (Glycine soja) as a breeding resource. Use of a single nucleotide polymorphism (SNP) genotyping array is a promising method for dissecting cultivated and wild germplasms to identify important adaptive genes through high-density genetic mapping and genome-wide association studies. Here we describe a large soybean SNP array for use in diversity analyses, linkage mapping and genome-wide association analyses. More than four million high-quality SNPs identified from high-depth genome re-sequencing of 16 soybean accessions and low-depth genome re-sequencing of 31 soybean accessions were used to select 180,961 SNPs for creation of the Axiom(®) SoyaSNP array. Validation analysis for a set of 222 diverse soybean lines showed that 170,223 markers were of good quality for genotyping. Phylogenetic and allele frequency analyses of the validation set data indicated that accessions showing an intermediate morphology between cultivated and wild soybeans collected in Korea were natural hybrids. More than 90 unanchored scaffolds in the current soybean reference sequence were assigned to chromosomes using this array. Finally, dense average spacing and preferential distribution of the SNPs in gene-rich chromosomal regions suggest that this array may be suitable for genome-wide association studies of soybean germplasm. Taken together, these results suggest that use of this array may be a powerful method for soybean genetic analyses relating to many aspects of soybean breeding.

Keywords: Glycine max; SNP array; genome-wide association study; genotyping; hybrid; unanchored scaffold; validation.

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Genome-Wide Association Study
  • Genotyping Techniques*
  • Glycine max / genetics*
  • Hybridization, Genetic
  • Oligonucleotide Array Sequence Analysis
  • Polymorphism, Single Nucleotide*