RADseq as a valuable tool for plants with large genomes-A case study in cycads

Mol Ecol Resour. 2019 Nov;19(6):1610-1622. doi: 10.1111/1755-0998.13085. Epub 2019 Sep 30.

Abstract

Full genome sequencing of organisms with large and complex genomes is intractable and cost ineffective under most research budgets. Cycads (Cycadales) represent one of the oldest lineages of the extant seed plants and, partly due to their age, have incredibly large genomes up to ~60 Gbp. Restriction site-associated DNA sequencing (RADseq) offers an approach to find genome-wide informative markers and has proven to be effective with both model and nonmodel organisms. We tested the application of RADseq using ezRAD across all 10 genera of the Cycadales including an example data set of Cycas calcicola representing 72 samples from natural populations. Using previously available plastid and mitochondrial genomes as references, reads were mapped recovering plastid and mitochondrial genome regions and nuclear markers for all of the genera. De novo assembly generated up to 138,407 high-depth clusters and up to 1,705 phylogenetically informative loci for the genera, and 4,421 loci for the example assembly of C. calcicola. The number of loci recovered by de novo assembly was lower than previous RADseq studies, yet still sufficient for downstream analysis. However, the number of markers could be increased by relaxing our assembly parameters, especially for the C. calcicola data set. Our results demonstrate the successful application of RADseq across the Cycadales to generate a large number of markers for all genomic compartments, despite the large number of plastids present in a typical plant cell. Our modified protocol was adapted to be applied to cycads and other organisms with large genomes to yield many informative genome-wide markers.

Keywords: RADseq; cycads; illumina sequencing; large genomes.

MeSH terms

  • Cycas / genetics*
  • Genetic Markers / genetics
  • Genome, Mitochondrial / genetics
  • Genome, Plant / genetics*
  • Genomics / methods
  • Phylogeny
  • Sequence Analysis, DNA / methods*

Substances

  • Genetic Markers