Pinpointing Genomic Regions and Candidate Genes Associated with Seed Oil and Protein Content in Soybean through an Integrative Transcriptomic and QTL Meta-Analysis

Cells. 2022 Dec 26;12(1):97. doi: 10.3390/cells12010097.

Abstract

Soybean with enriched nutrients has emerged as a prominent source of edible oil and protein. In the present study, a meta-analysis was performed by integrating quantitative trait loci (QTLs) information, region-specific association and transcriptomic analysis. Analysis of about a thousand QTLs previously identified in soybean helped to pinpoint 14 meta-QTLs for oil and 16 meta-QTLs for protein content. Similarly, region-specific association analysis using whole genome re-sequenced data was performed for the most promising meta-QTL on chromosomes 6 and 20. Only 94 out of 468 genes related to fatty acid and protein metabolic pathways identified within the meta-QTL region were found to be expressed in seeds. Allele mining and haplotyping of these selected genes were performed using whole genome resequencing data. Interestingly, a significant haplotypic association of some genes with oil and protein content was observed, for instance, in the case of FAD2-1B gene, an average seed oil content of 20.22% for haplotype 1 compared to 15.52% for haplotype 5 was observed. In addition, the mutation S86F in the FAD2-1B gene produces a destabilizing effect of (ΔΔG Stability) -0.31 kcal/mol. Transcriptomic analysis revealed the tissue-specific expression of candidate genes. Based on their higher expression in seed developmental stages, genes such as sugar transporter, fatty acid desaturase (FAD), lipid transporter, major facilitator protein and amino acid transporter can be targeted for functional validation. The approach and information generated in the present study will be helpful in the map-based cloning of regulatory genes, as well as for marker-assisted breeding in soybean.

Keywords: haplotyping; meta-analysis; nutrition; quantitative trait loci; soybean; transcriptomics.

Publication types

  • Meta-Analysis
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Chromosome Mapping
  • Genomics
  • Glycine max* / chemistry
  • Plant Breeding
  • Plant Oils / metabolism
  • Quantitative Trait Loci* / genetics
  • Seeds / metabolism
  • Transcriptome / genetics

Substances

  • Plant Oils

Grants and funding

The authors are thankful to the Department of Biotechnology (DBT), Government of India (GoI) for the Ramalingaswami Fellowship Award to H.S. and R.D.; Grant BT/PR32853/AGIII/103/1159/2019 and Grant BT/PR38279/GET/119/351/2020 to H.S., R.D. and T.R.S.; the Science and Engineering Research Board (SERB), India, Department of Science and Technology (DST), Government of India (GoI), for the J.C. Bose Fellowship to T.R.S., and Research grant CRG/2019/006599 awarded to R.D., H.S. and T.R.S.; SPARC/2018-2019/P442/SL to H.S. and V.G., and the University Grants Commission (UGC) for providing JRF to V.K., S.K. and S.S.