Effectiveness of Two Universal Angiosperm Probe Sets Tested In Silico for Caryophyllids Taxa with Emphasis on Cacti Species

Genes (Basel). 2022 Mar 24;13(4):570. doi: 10.3390/genes13040570.

Abstract

In angiosperms, huge advances in massive DNA sequencing technologies have impacted phylogenetic studies. Probe sets have been developed with the purpose of recovering hundreds of orthologous loci of targeted DNA sequences (TDS) across different plant lineages. We tested in silico the effectiveness of two universal probe sets in the whole available genomes of Caryophyllids, emphasizing phylogenetic issues in cacti species. A total of 870 TDS (517 TDS from Angiosperm v.1 and 353 from Angiosperms353) were individually tested in nine cacti species and Amaranthus hypochondriacus (external group) with ≥17 Gbp of available DNA data. The effectiveness was measured by the total number of orthologous loci recovered and their length, the percentage of loci discarded by paralogy, and the proportion of informative sites (PIS) in the alignments. The results showed that, on average, Angiosperms353 was better than Angiosperm v.1 for cacti species, since the former obtained an average of 275.6 loci that represent 123,687 bp, 2.48% of paralogous loci, and 4.32% of PIS in alignments, whereas the latter recovered 148.4 loci (37,683 bp), 10.38% of paralogous loci, and 3.49% of PIS. We recommend the use of predesigned universal probe sets for Caryophyllids, since these recover a high number of orthologous loci that resolve phylogenetic relationships.

Keywords: cactaceae; caryophyllids; phylogenomics; target enrichment; universal probe sets.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cactaceae* / genetics
  • Genome
  • Magnoliopsida* / genetics
  • Phylogeny
  • Sequence Analysis, DNA / methods