treeman: an R package for efficient and intuitive manipulation of phylogenetic trees

BMC Res Notes. 2017 Jan 7;10(1):30. doi: 10.1186/s13104-016-2340-8.

Abstract

Background: Phylogenetic trees are hierarchical structures used for representing the inter-relationships between biological entities. They are the most common tool for representing evolution and are essential to a range of fields across the life sciences. The manipulation of phylogenetic trees-in terms of adding or removing tips-is often performed by researchers not just for reasons of management but also for performing simulations in order to understand the processes of evolution. Despite this, the most common programming language among biologists, R, has few class structures well suited to these tasks.

Results: We present an R package that contains a new class, called TreeMan, for representing the phylogenetic tree. This class has a list structure allowing phylogenetic trees to be manipulated more efficiently. Computational running times are reduced because of the ready ability to vectorise and parallelise methods. Development is also improved due to fewer lines of code being required for performing manipulation processes.

Conclusions: We present three use cases-pinning missing taxa to a supertree, simulating evolution with a tree-growth model and detecting significant phylogenetic turnover-that demonstrate the new package's speed and simplicity.

Keywords: Evolution; Phylogenetic trees; R; Statistical computing; Tree simulation.

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Computer Simulation
  • Ecology
  • Models, Statistical
  • Phylogeny*
  • Software*