A brief review of software tools for pangenomics

Genomics Proteomics Bioinformatics. 2015 Feb;13(1):73-6. doi: 10.1016/j.gpb.2015.01.007. Epub 2015 Feb 23.

Abstract

Since the proposal for pangenomic study, there have been a dozen software tools actively in use for pangenomic analysis. By the end of 2014, Panseq and the pan-genomes analysis pipeline (PGAP) ranked as the top two most popular packages according to cumulative citations of peer-reviewed scientific publications. The functions of the software packages and tools, albeit variable among them, include categorizing orthologous genes, calculating pangenomic profiles, integrating gene annotations, and constructing phylogenies. As epigenomic elements are being gradually revealed in prokaryotes, it is expected that pangenomic databases and toolkits have to be extended to handle information of detailed functional annotations for genes and non-protein-coding sequences including non-coding RNAs, insertion elements, and conserved structural elements. To develop better bioinformatic tools, user feedback and integration of novel features are both of essence.

Keywords: Comparative analysis; Core genes; Genomic dynamics; Pangenome; Pangenomics.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Bacteria / genetics*
  • Computational Biology*
  • Databases, Factual
  • Genome*
  • Genomics / methods*
  • Humans
  • Molecular Sequence Annotation / methods*
  • Open Reading Frames / genetics
  • Phylogeny
  • Software*