HiMAP: Robust phylogenomics from highly multiplexed amplicon sequencing

Mol Ecol Resour. 2018 Apr 6. doi: 10.1111/1755-0998.12783. Online ahead of print.

Abstract

High-throughput sequencing has fundamentally changed how molecular phylogenetic data sets are assembled, and phylogenomic data sets commonly contain 50- to 100-fold more loci than those generated using traditional Sanger sequencing-based approaches. Here, we demonstrate a new approach for building phylogenomic data sets using single-tube, highly multiplexed amplicon sequencing, which we name HiMAP (highly multiplexed amplicon-based phylogenomics) and present bioinformatic pipelines for locus selection based on genomic and transcriptomic data resources and postsequencing consensus calling and alignment. This method is inexpensive and amenable to sequencing a large number (hundreds) of taxa simultaneously and requires minimal hands-on time at the bench (<1/2 day), and data analysis can be accomplished without the need for read mapping or assembly. We demonstrate this approach by sequencing 878 amplicons in single reactions for 82 species of tephritid fruit flies across seven genera (384 individuals), including some of the most economically important agricultural insect pests. The resulting filtered data set (>150,000-bp concatenated alignment, ~20% missing character sites across all individuals and amplicons) contained >40,000 phylogenetically informative characters, and although some discordance was observed between analyses, it provided unparalleled resolution of many phylogenetic relationships in this group. Most notably, we found high support for the generic status of Zeugodacus and the sister relationship between Dacus and Zeugodacus. We discuss HiMAP, with regard to its molecular and bioinformatic strengths, and the insight the resulting data set provides into relationships of this diverse insect group.

Keywords: Bactrocera; Tephritidae; high-throughput sequencing; phylogenetics; systematics.