Exploring the sorghum race level diversity utilizing 272 sorghum accessions genomic resources

Front Plant Sci. 2023 Mar 17:14:1143512. doi: 10.3389/fpls.2023.1143512. eCollection 2023.

Abstract

Due to evolutionary divergence, sorghum race populations exhibit significant genetic and morphological variation. A k-mer-based sorghum race sequence comparison identified the conserved k-mers of all 272 accessions from sorghum and the race-specific genetic signatures identified the gene variability in 10,321 genes (PAVs). To understand sorghum race structure, diversity and domestication, a deep learning-based variant calling approach was employed in a set of genotypic data derived from a diverse panel of 272 sorghum accessions. The data resulted in 1.7 million high-quality genome-wide SNPs and identified selective signature (both positive and negative) regions through a genome-wide scan with different (iHS and XP-EHH) statistical methods. We discovered 2,370 genes associated with selection signatures including 179 selective sweep regions distributed over 10 chromosomes. Co-localization of these regions undergoing selective pressure with previously reported QTLs and genes revealed that the signatures of selection could be related to the domestication of important agronomic traits such as biomass and plant height. The developed k-mer signatures will be useful in the future to identify the sorghum race and for trait and SNP markers for assisting in plant breeding programs.

Keywords: deep learning; deep variant calling; gene enrichment; k-mer analysis; positive and negative selection; selection pressure; sorghum race.

Grants and funding

The authors also acknowledge the supporting funds from AVISA (OPP1198373) and ICAR-BMGF (101165). We also acknowledge the support from the Bill and Melinda Gates Foundation (BMGF – INV-037010).