Phylogeny of Crataegus (Rosaceae) based on 257 nuclear loci and chloroplast genomes: evaluating the impact of hybridization

PeerJ. 2021 Oct 26:9:e12418. doi: 10.7717/peerj.12418. eCollection 2021.

Abstract

Background: Hawthorn species (Crataegus L.; Rosaceae tribe Maleae) form a well-defined clade comprising five subgeneric groups readily distinguished using either molecular or morphological data. While multiple subsidiary groups (taxonomic sections, series) are recognized within some subgenera, the number of and relationships among species in these groups are subject to disagreement. Gametophytic apomixis and polyploidy are prevalent in the genus, and disagreement concerns whether and how apomictic genotypes should be recognized taxonomically. Recent studies suggest that many polyploids arise from hybridization between members of different infrageneric groups.

Methods: We used target capture and high throughput sequencing to obtain nucleotide sequences for 257 nuclear loci and nearly complete chloroplast genomes from a sample of hawthorns representing all five currently recognized subgenera. Our sample is structured to include two examples of intersubgeneric hybrids and their putative diploid and tetraploid parents. We queried the alignment of nuclear loci directly for evidence of hybridization, and compared individual gene trees with each other, and with both the maximum likelihood plastome tree and the nuclear concatenated and multilocus coalescent-based trees. Tree comparisons provided a promising, if challenging (because of the number of comparisons involved) method for visualizing variation in tree topology. We found it useful to deploy comparisons based not only on tree-tree distances but also on a metric of tree-tree concordance that uses extrinsic information about the relatedness of the terminals in comparing tree topologies.

Results: We obtained well-supported phylogenies from plastome sequences and from a minimum of 244 low copy-number nuclear loci. These are consistent with a previous morphology-based subgeneric classification of the genus. Despite the high heterogeneity of individual gene trees, we corroborate earlier evidence for the importance of hybridization in the evolution of Crataegus. Hybridization between subgenus Americanae and subgenus Sanguineae was documented for the origin of Sanguineae tetraploids, but not for a tetraploid Americanae species. This is also the first application of target capture probes designed with apple genome sequence. We successfully assembled 95% of 257 loci in Crataegus, indicating their potential utility across the genera of the apple tribe.

Keywords: Crataegus; Hyb-Seq; Hybridization; Nuclear loci; Phylogeny; Plastomes; Rosaceae; Target capture phylogenetics; Tree-tree comparisons.

Grants and funding

This research was made possible with the support of the Royal Ontario Museum (ROM) DMV Acquisition & Research Fund, 2013; Discovery Grant (A3430; re-applied for at 3–5 year intervals), Natural Sciences and Engineering Research Council of Canada, 1987–2015; Royal Ontario Museum departmental fieldwork funds (Natural History, Center for Biodiversity & Conservation Biology, Science, Science Cooperative Field Studies), 1988-2008; Research Grants, ROM/ROMCA Special Research Fund, 2014–2019; and Strategic Project Grant (381073), Natural Sciences and Engineering Research Council of Canada, 2010–2012, joint with Prof. S. Stefanović (University of Toronto – Mississauga), Paul Shipley (University of British Columbia – Okanagan), S. Proctor (University of Alberta) and the Naturally Grown Herb and Spice Growers Co-operative (HerbPro; Edgewood BC, J. Lee, President); Paula Brown (British Columbia Institute of Technology) materially assisted us in obtaining this grant. Student assistants in the ROM Green Plant Herbarium were co-funded by the University of Toronto Work Study program; all of the University of Toronto and NSERCC funding was facilitated by the Department of Ecology and Evolution (and its predecessor, the Department of Botany). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.