A Long Fragment Aligner called ALFALFA

Michaël Vyverman; Bernard De Baets; Veerle Fack; Peter Dawyndt

doi:10.1186/s12859-015-0533-0

A Long Fragment Aligner called ALFALFA

BMC Bioinformatics. 2015 May 15;16(1):159. doi: 10.1186/s12859-015-0533-0.

Authors

Michaël Vyverman¹, Bernard De Baets², Veerle Fack³, Peter Dawyndt⁴

Affiliations

¹ Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Krijgslaan 281 Building S9, Ghent, B-9000, Belgium. Michael.Vyverman@UGent.be.
² Department of Mathematical Modelling, Statistics and Bioinformatics, Ghent University, Coupure links 653, Ghent, B-9000, Belgium. Bernard.DeBaets@UGent.be.
³ Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Krijgslaan 281 Building S9, Ghent, B-9000, Belgium. Veerle.Fack@UGent.be.
⁴ Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Krijgslaan 281 Building S9, Ghent, B-9000, Belgium. Peter.Dawyndt@UGent.be.

Abstract

Background: Rapid evolutions in sequencing technology force read mappers into flexible adaptation to longer reads, changing error models, memory barriers and novel applications.

Results: ALFALFA achieves a high performance in accurately mapping long single-end and paired-end reads to gigabase-scale reference genomes, while remaining competitive for mapping shorter reads. Its seed-and-extend workflow is underpinned by fast retrieval of super-maximal exact matches from an enhanced sparse suffix array, with flexible parameter tuning to balance performance, memory footprint and accuracy.

Conclusions: ALFALFA is open source and available at http://alfalfa.ugent.be .

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Biological Evolution
Genome, Human*
High-Throughput Nucleotide Sequencing / methods*
Humans
Sequence Alignment / methods*
Sequence Analysis, DNA / methods*
Software*
Workflow