Switchgrass genomic diversity, ploidy, and evolution: novel insights from a network-based SNP discovery protocol

PLoS Genet. 2013;9(1):e1003215. doi: 10.1371/journal.pgen.1003215. Epub 2013 Jan 17.

Abstract

Switchgrass (Panicum virgatum L.) is a perennial grass that has been designated as an herbaceous model biofuel crop for the United States of America. To facilitate accelerated breeding programs of switchgrass, we developed both an association panel and linkage populations for genome-wide association study (GWAS) and genomic selection (GS). All of the 840 individuals were then genotyped using genotyping by sequencing (GBS), generating 350 GB of sequence in total. As a highly heterozygous polyploid (tetraploid and octoploid) species lacking a reference genome, switchgrass is highly intractable with earlier methodologies of single nucleotide polymorphism (SNP) discovery. To access the genetic diversity of species like switchgrass, we developed a SNP discovery pipeline based on a network approach called the Universal Network-Enabled Analysis Kit (UNEAK). Complexities that hinder single nucleotide polymorphism discovery, such as repeats, paralogs, and sequencing errors, are easily resolved with UNEAK. Here, 1.2 million putative SNPs were discovered in a diverse collection of primarily upland, northern-adapted switchgrass populations. Further analysis of this data set revealed the fundamentally diploid nature of tetraploid switchgrass. Taking advantage of the high conservation of genome structure between switchgrass and foxtail millet (Setaria italica (L.) P. Beauv.), two parent-specific, synteny-based, ultra high-density linkage maps containing a total of 88,217 SNPs were constructed. Also, our results showed clear patterns of isolation-by-distance and isolation-by-ploidy in natural populations of switchgrass. Phylogenetic analysis supported a general south-to-north migration path of switchgrass. In addition, this analysis suggested that upland tetraploid arose from upland octoploid. All together, this study provides unparalleled insights into the diversity, genomic complexity, population structure, phylogeny, phylogeography, ploidy, and evolutionary dynamics of switchgrass.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Biofuels
  • Biological Evolution
  • Chromosome Mapping
  • Genetic Variation*
  • Genome, Plant
  • Genome-Wide Association Study*
  • Panicum / genetics*
  • Phylogeny
  • Phylogeography
  • Polymorphism, Single Nucleotide
  • Polyploidy*
  • Selection, Genetic
  • Sequence Analysis, DNA
  • Synteny

Substances

  • Biofuels

Grants and funding

This project was funded by the United States Department of Energy and United States Department of Agriculture Plant Feedstock Genomics for Bioenergy Program (Project no. DE-AI02-07ER64454), National Science Foundation awards 0820619 and 0965342, and the United States Department of Agriculture. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.