An Unsupervised Classifier for Whole-Genome Phylogenies, the Maxwell© Tool

Int J Mol Sci. 2023 Nov 13;24(22):16278. doi: 10.3390/ijms242216278.

Abstract

The development of phylogenetic trees based on RNA or DNA sequences generally requires a precise and limited choice of important RNAs, e.g., messenger RNAs of essential proteins or ribosomal RNAs (like 16S), but rarely complete genomes, making it possible to explain evolution and speciation. In this article, we propose revisiting a classic phylogeny of archaea from only the information on the succession of nucleotides of their entire genome. For this purpose, we use a new tool, the unsupervised classifier Maxwell, whose principle lies in the Burrows-Wheeler compression transform, and we show its efficiency in clustering whole archaeal genomes.

Keywords: Burrows–Wheeler compression transform; Vitányi distance; maxwell classifier; normalized compression distance (NCD); phylogenetic trees; unsupervised classifier.

MeSH terms

  • Archaea* / genetics
  • Base Sequence
  • Genome*
  • Phylogeny
  • RNA, Ribosomal

Substances

  • RNA, Ribosomal

Grants and funding

This research received no external funding.