Reconstructing phylogenetic relationships based on repeat sequence similarities

Daniel Vitales; Sònia Garcia; Steven Dodsworth

doi:10.1016/j.ympev.2020.106766

Reconstructing phylogenetic relationships based on repeat sequence similarities

Mol Phylogenet Evol. 2020 Jun:147:106766. doi: 10.1016/j.ympev.2020.106766. Epub 2020 Feb 28.

Authors

Daniel Vitales¹, Sònia Garcia², Steven Dodsworth³

Affiliations

¹ Institut Botànic de Barcelona (IBB, CSIC-Ajuntament de Barcelona), Barcelona, Catalonia, Spain; Laboratori de Botànica (UB) - Unitat associada al CSIC, Facultat de Farmàcia i Ciències de l'Alimentació, Universitat de Barcelona, Av. Joan XXIII 27-31, 08028 Barcelona, Catalonia, Spain. Electronic address: daniel.vitales@ibb.csic.es.
² Institut Botànic de Barcelona (IBB, CSIC-Ajuntament de Barcelona), Barcelona, Catalonia, Spain.
³ School of Life Sciences, University of Bedfordshire, Luton, United Kingdom.

PMID: 32119996
DOI: 10.1016/j.ympev.2020.106766

Abstract

A recent phylogenetic method based on genome-wide abundance of different repeat types proved to be useful in reconstructing the evolutionary history of several plant and animal groups. Here, we demonstrate that an alternative information source from the repeatome can also be employed to infer phylogenetic relationships among taxa. Specifically, this novel approach makes use of the repeat sequence similarity matrices obtained from the comparative clustering analyses of RepeatExplorer 2, which are subsequently transformed to between-taxa distance matrices. These pairwise matrices are used to construct neighbour-joining trees for each of the top most-abundant clusters and they are finally summarized in a consensus network. This methodology was tested on three groups of angiosperms and one group of insects, resulting in congruent evolutionary hypotheses compared to more standard systematic analyses based on commonly used DNA markers. We propose that the combined application of these phylogenetic approaches based on repeat abundances and repeat sequence similarities could be helpful to understand mechanisms governing genome and repeatome evolution.

Keywords: Genomics; Graph-based clustering; High-throughput sequencing; Next-generation sequencing; Phylogenetics; repetitive DNA.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Animals
Databases, Genetic
Evolution, Molecular
Genetic Markers
Magnoliopsida / genetics
Phylogeny*
Repetitive Sequences, Nucleic Acid / genetics*
Sequence Homology, Nucleic Acid*
Species Specificity

Substances

Genetic Markers