Identifying Wild Versus Cultivated Gene-Alleles Conferring Seed Coat Color and Days to Flowering in Soybean

Int J Mol Sci. 2021 Feb 4;22(4):1559. doi: 10.3390/ijms22041559.

Abstract

Annual wild soybean (G. soja) is the ancestor of the cultivated soybean (G. max). To reveal the genetic changes from soja to max, an improved wild soybean chromosome segment substitution line (CSSL) population, SojaCSSLP5, composed of 177 CSSLs with 182 SSR markers (SSR-map), was developed based on SojaCSSLP1 generated from NN1138-2(maxN24852(soja). The SojaCSSLP5 was genotyped further through whole-genome resequencing, resulting in a physical map with 1366 SNPLDBs (SNP linkage-disequilibrium blocks), which are composed of more markers/segments, shorter marker length and more recombination breakpoints than the SSR-map and caused 721 new wild substituted segments. Using the SNPLDB-map, two loci co-segregating with seed-coat color (SCC) and six loci for days to flowering (DTF) with 88.02% phenotypic contribution were identified. Integrated with parental RNA-seq and DNA-resequencing, two SCC and six DTF candidate genes, including three previously cloned (G, E2 and GmPRR3B) and five newly detected ones, were predicted and verified at nucleotide mutant level, and then demonstrated with the consistency between gene-alleles and their phenotypes in SojaCSSLP5. In total, six of the eight genes were identified with the parental allele-pairs coincided to those in 303 germplasm accessions, then were further demonstrated by the consistency between gene-alleles and germplasm phenotypes. Accordingly, the CSSL population integrated with parental DNA and RNA sequencing data was demonstrated to be an efficient platform in identifying candidate wild vs. cultivated gene-alleles.

Keywords: SNP linkage disequilibrium block (SNPLDB); annual wild soybean (G. soja Sieb. and Zucc.); chromosome segment substitution line (CSSL); cultivated soybean (G. max (L.) Merr.); days to flowering; seed coat color; whole-genome re-sequencing.

MeSH terms

  • Alleles*
  • Chromosome Mapping
  • Computational Biology / methods
  • Flowers / genetics*
  • Genes, Plant*
  • Genetic Association Studies
  • Genetic Loci
  • Genome, Plant
  • Genotype
  • Glycine max / genetics*
  • Linkage Disequilibrium
  • Microsatellite Repeats
  • Phenotype
  • Polymorphism, Single Nucleotide
  • Quantitative Trait, Heritable*
  • Seeds*
  • Whole Genome Sequencing