HiMAP2: Identifying phylogenetically informative genetic markers from diverse genomic resources

Mol Ecol Resour. 2023 Jul;23(5):1155-1167. doi: 10.1111/1755-0998.13762. Epub 2023 Feb 15.

Abstract

Multiplexed amplicon sequencing offers a cost-effective and rapid solution for phylogenomic studies that include a large number of individuals. Selecting informative genetic markers is a critical initial step in designing such multiplexed amplicon panels, but screening various genomic data and selecting markers that are informative for the question at hand can be laborious. Here, we present a flexible and user-friendly tool, HiMAP2, for identifying, visualizing and filtering phylogenetically informative loci from diverse genomic and transcriptomic resources. This bioinformatics pipeline includes orthology prediction, exon extraction and filtering of aligned exon sequences according to user-defined specifications. Additionally, HiMAP2 facilitates exploration of the final filtered exons by incorporating phylogenetic inference of individual exon trees with raxml-ng as well as the estimation of a species tree using astral. Finally, results of the marker selection can be visualized and refined with an interactive Bokeh application that can be used to generate publication-quality figures. Source code and user instructions for HiMAP2 are available at https://github.com/popphylotools/HiMAP_v2.

Keywords: comparative genomics; genome; phylogenetic informativeness; targeted amplicon design; transcriptome.

MeSH terms

  • Genetic Markers
  • Genome*
  • Genomics*
  • Humans
  • Phylogeny
  • Software

Substances

  • Genetic Markers