Species Identification of Oaks (Quercus L., Fagaceae) from Gene to Genome

Int J Mol Sci. 2019 Nov 26;20(23):5940. doi: 10.3390/ijms20235940.

Abstract

Species identification of oaks (Quercus) is always a challenge because many species exhibit variable phenotypes that overlap with other species. Oaks are notorious for interspecific hybridization and introgression, and complex speciation patterns involving incomplete lineage sorting. Therefore, accurately identifying Quercus species barcodes has been unsuccessful. In this study, we used chloroplast genome sequence data to identify molecular markers for oak species identification. Using next generation sequencing methods, we sequenced 14 chloroplast genomes of Quercus species in this study and added 10 additional chloroplast genome sequences from GenBank to develop a DNA barcode for oaks. Chloroplast genome sequence divergence was low. We identified four mutation hotspots as candidate Quercus DNA barcodes; two intergenic regions (matK-trnK-rps16 and trnR-atpA) were located in the large single copy region, and two coding regions (ndhF and ycf1b) were located in the small single copy region. The standard plant DNA barcode (rbcL and matK) had lower variability than that of the newly identified markers. Our data provide complete chloroplast genome sequences that improve the phylogenetic resolution and species level discrimination of Quercus. This study demonstrates that the complete chloroplast genome can substantially increase species discriminatory power and resolve phylogenetic relationships in plants.

Keywords: Quercus; chloroplast genome; mutation hotspots; oak species identification.

MeSH terms

  • Chloroplasts / genetics*
  • DNA Barcoding, Taxonomic / methods*
  • Evolution, Molecular
  • Genetic Markers
  • Genome, Chloroplast
  • High-Throughput Nucleotide Sequencing
  • Mutation
  • Phylogeny
  • Quercus / classification*
  • Quercus / genetics
  • Sequence Analysis, DNA

Substances

  • Genetic Markers