Tedna: a transposable element de novo assembler

Matthias Zytnicki; Eduard Akhunov; Hadi Quesneville

doi:10.1093/bioinformatics/btu365

Tedna: a transposable element de novo assembler

Bioinformatics. 2014 Sep 15;30(18):2656-8. doi: 10.1093/bioinformatics/btu365. Epub 2014 Jun 3.

Authors

Matthias Zytnicki¹, Eduard Akhunov¹, Hadi Quesneville¹

Affiliation

¹ INRA, URGI, Plant Breeding and Biology, Versailles 78026, France and Department of Plant Pathology, Kansas State University, Manhattan, KS 66506, USA.

PMID: 24894500
DOI: 10.1093/bioinformatics/btu365

Abstract

Motivation: Recent technological advances are allowing many laboratories to sequence their research organisms. Available de novo assemblers leave repetitive portions of the genome poorly assembled. Some genomes contain high proportions of transposable elements, and transposable elements appear to be a major force behind diversity and adaptation. Few de novo assemblers for transposable elements exist, and most have either been designed for small genomes or 454 reads.

Results: In this article, we present a new transposable element de novo assembler, Tedna, which assembles a set of transposable elements directly from the reads. Tedna uses Illumina paired-end reads, the most widely used sequencing technology for de novo assembly, and forms full-length transposable elements.

Availability and implementation: Tedna is available at http://urgi.versailles.inra.fr/Tools/Tedna, under the GPLv3 license. It is written in C++11 and only requires the Sparsehash Package, freely available under the New BSD License. Tedna can be used on standard computers with limited RAM resources, although it may also use large memory for better results. Most of the code is parallelized and thus ready for large infrastructures.

MeSH terms

Arabidopsis / genetics
DNA Transposable Elements / genetics*
Genomics / methods*
Repetitive Sequences, Nucleic Acid
Sequence Analysis, DNA / methods*
Triticum / genetics

Substances

DNA Transposable Elements