Reconstructing phylogenetic relationships based on repeat sequence similarities

Mol Phylogenet Evol. 2020 Jun:147:106766. doi: 10.1016/j.ympev.2020.106766. Epub 2020 Feb 28.

Abstract

A recent phylogenetic method based on genome-wide abundance of different repeat types proved to be useful in reconstructing the evolutionary history of several plant and animal groups. Here, we demonstrate that an alternative information source from the repeatome can also be employed to infer phylogenetic relationships among taxa. Specifically, this novel approach makes use of the repeat sequence similarity matrices obtained from the comparative clustering analyses of RepeatExplorer 2, which are subsequently transformed to between-taxa distance matrices. These pairwise matrices are used to construct neighbour-joining trees for each of the top most-abundant clusters and they are finally summarized in a consensus network. This methodology was tested on three groups of angiosperms and one group of insects, resulting in congruent evolutionary hypotheses compared to more standard systematic analyses based on commonly used DNA markers. We propose that the combined application of these phylogenetic approaches based on repeat abundances and repeat sequence similarities could be helpful to understand mechanisms governing genome and repeatome evolution.

Keywords: Genomics; Graph-based clustering; High-throughput sequencing; Next-generation sequencing; Phylogenetics; repetitive DNA.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Databases, Genetic
  • Evolution, Molecular
  • Genetic Markers
  • Magnoliopsida / genetics
  • Phylogeny*
  • Repetitive Sequences, Nucleic Acid / genetics*
  • Sequence Homology, Nucleic Acid*
  • Species Specificity

Substances

  • Genetic Markers