Inference of transposable element ancestry

Aaron C Wacholder; Corey Cox; Thomas J Meyer; Robert P Ruggiero; Vijetha Vemulapalli; Annette Damert; Lucia Carbone; David D Pollock

doi:10.1371/journal.pgen.1004482

Inference of transposable element ancestry

PLoS Genet. 2014 Aug 14;10(8):e1004482. doi: 10.1371/journal.pgen.1004482. eCollection 2014 Aug.

Authors

Aaron C Wacholder¹, Corey Cox¹, Thomas J Meyer², Robert P Ruggiero¹, Vijetha Vemulapalli¹, Annette Damert³, Lucia Carbone², David D Pollock¹

Affiliations

¹ Department of Biochemistry & Molecular Genetics, University of Colorado School of Medicine, Aurora, Colorado, United States of America.
² Department of Behavioural Neuroscience, Oregon Health Sciences University, Portland, Oregon, United States of America; Division of Neuroscience, Oregon National Primate Research Center, Beaverton, Oregon, United States of America.
³ Molecular Biology Centre, Institute for Interdisciplinary Research in Bio-Nano Sciences, Babes-Bolyai-University, Cluj-Napoca, Romania.

Abstract

Most common methods for inferring transposable element (TE) evolutionary relationships are based on dividing TEs into subfamilies using shared diagnostic nucleotides. Although originally justified based on the "master gene" model of TE evolution, computational and experimental work indicates that many of the subfamilies generated by these methods contain multiple source elements. This implies that subfamily-based methods give an incomplete picture of TE relationships. Studies on selection, functional exaptation, and predictions of horizontal transfer may all be affected. Here, we develop a Bayesian method for inferring TE ancestry that gives the probability that each sequence was replicative, its frequency of replication, and the probability that each extant TE sequence came from each possible ancestral sequence. Applying our method to 986 members of the newly-discovered LAVA family of TEs, we show that there were far more source elements in the history of LAVA expansion than subfamilies identified using the CoSeg subfamily-classification program. We also identify multiple replicative elements in the AluSc subfamily in humans. Our results strongly indicate that a reassessment of subfamily structures is necessary to obtain accurate estimates of mutation processes, phylogenetic relationships and historical times of activity.

Publication types

Research Support, N.I.H., Extramural

MeSH terms

Bayes Theorem
DNA Transposable Elements / genetics*
Evolution, Molecular*
Gene Transfer, Horizontal / genetics
Humans
Mutation
Phylogeny*

Substances

DNA Transposable Elements

Abstract

Publication types

MeSH terms

Substances

Grants and funding