Korean soybean core collection: Genotypic and phenotypic diversity population structure and genome-wide association study

PLoS One. 2019 Oct 22;14(10):e0224074. doi: 10.1371/journal.pone.0224074. eCollection 2019.

Abstract

A core collection is a subset that represents genetic diversity of the total collection. Soybean (Glycine max (L.) Merr.) is one of major food and feed crops. It is the world's most cultivated annual herbaceous legume. Constructing a core collection for soybean could play a pivotal role in conserving and utilizing its genetic variability for research and breeding programs. To construct and evaluate a Korean soybean core collection, genotypic and phenotypic data as well as population structure, were analyzed. The Korean soybean core collection consisted of 430 accessions selected from 2,872 collections based on Affymetrix Axiom® 180k SoyaSNP array data. The core collection represented 99% of genotypic diversity of the total collection. Analysis of population structure clustered the core collection into five subpopulations. Accessions from South Korea and North Korea were distributed across five subpopulations. Analysis of molecular variance indicated that only 2.01% of genetic variation could be explained by geographic origins while 16.18% of genetic variation was accounted for by subpopulations. Genome-wide association study (GWAS) for days to flowering, flower color, pubescent color, and growth habit confirmed that the core collection had the same genetic diversity for tested traits as the total collection. The Korean soybean core collection was constructed based on genotypic information of the 180k SNP data. Size and phenotypic diversity of the core collection accounted for approximately 14.9% and 18.1% of the total collection, respectively. GWAS of core and total collections successfully confirmed loci associated with tested traits. Consequently, the present study showed that the Korean soybean core collection could provide fundamental and practical material and information for both soybean genetic research and breeding programs.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Genome, Plant*
  • Genome-Wide Association Study / methods*
  • Genotype
  • Glycine max / classification*
  • Glycine max / genetics*
  • Humans
  • Phenotype
  • Plant Breeding*
  • Plant Proteins / genetics*
  • Polymorphism, Single Nucleotide*
  • Republic of Korea

Substances

  • Plant Proteins

Grants and funding

This work was supported by a grant from the Next-Generation BioGreen-21 Program (No. PJ01321302), Rural Development Administration, Republic of Korea to MSC. Author KK is employed by and receives salary from FarmHannong, Ltd. The specific roles of these authors are articulated in the ‘author contributions’ section. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.