Genetic variation among 481 diverse soybean accessions, inferred from genomic re-sequencing

Sci Data. 2021 Feb 8;8(1):50. doi: 10.1038/s41597-021-00834-w.

Abstract

We report characteristics of soybean genetic diversity and structure from the resequencing of 481 diverse soybean accessions, comprising 52 wild (Glycine soja) selections and 429 cultivated (Glycine max) varieties (landraces and elites). This data was used to identify 7.8 million SNPs, to predict SNP effects relative to genic regions, and to identify the genetic structure, relationships, and linkage disequilibrium. We found evidence of distinct, mostly independent selection of lineages by particular geographic location. Among cultivated varieties, we identified numerous highly conserved regions, suggesting selection during domestication. Comparisons of these accessions against the whole U.S. germplasm genotyped with the SoySNP50K iSelect BeadChip revealed that over 95% of the re-sequenced accessions have a high similarity to their SoySNP50K counterparts. Probable errors in seed source or genotype tracking were also identified in approximately 5% of the accessions.

Publication types

  • Dataset
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Crops, Agricultural / genetics
  • Fabaceae / genetics
  • Genome, Plant*
  • Genotype
  • Geography
  • Glycine max / genetics*
  • Linkage Disequilibrium
  • Polymorphism, Single Nucleotide*
  • Selection, Genetic