A customised target capture sequencing tool for molecular identification of Aloe vera and relatives

Sci Rep. 2021 Dec 21;11(1):24347. doi: 10.1038/s41598-021-03300-0.

Abstract

Plant molecular identification studies have, until recently, been limited to the use of highly conserved markers from plastid and other organellar genomes, compromising resolution in highly diverse plant clades. Due to their higher evolutionary rates and reduced paralogy, low-copy nuclear genes overcome this limitation but are difficult to sequence with conventional methods and require high-quality input DNA. Aloe vera and its relatives in the Alooideae clade (Asphodelaceae, subfamily Asphodeloideae) are of economic interest for food and health products and have horticultural value. However, pressing conservation issues are increasing the need for a molecular identification tool to regulate the trade. With > 600 species and an origin of ± 15 million years ago, this predominantly African succulent plant clade is a diverse and taxonomically complex group for which low-copy nuclear genes would be desirable for accurate species discrimination. Unfortunately, with an average genome size of 16.76 pg, obtaining high coverage sequencing data for these genes would be prohibitively costly and computationally demanding. We used newly generated transcriptome data to design a customised RNA-bait panel targeting 189 low-copy nuclear genes in Alooideae. We demonstrate its efficacy in obtaining high-coverage sequence data for the target loci on Illumina sequencing platforms, including degraded DNA samples from museum specimens, with considerably improved phylogenetic resolution. This customised target capture sequencing protocol has the potential to confidently indicate phylogenetic relationships of Aloe vera and related species, as well as aid molecular identification applications.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aloe / classification*
  • Aloe / genetics*
  • Aloe / metabolism
  • Biological Evolution*
  • Cell Nucleus / genetics*
  • Genome, Plant
  • High-Throughput Nucleotide Sequencing
  • Phylogeny*
  • Plant Proteins / genetics
  • Plant Proteins / metabolism*
  • Transcriptome

Substances

  • Plant Proteins