Multiple genome alignment based on longest path in directed acyclic graphs

Int J Bioinform Res Appl. 2010;6(4):366-83. doi: 10.1504/IJBRA.2010.036.

Abstract

In this paper, we present a simple and efficient algorithm for multiple genome sequence alignment. Sequences of Maximal Unique Matches (MUMs) are first transformed into a multi-bipartite diagram. The diagram is then converted into a Directed Acyclic Graph (DAG). Therefore, finding the alignment is reduced to finding the longest path in the DAG, which is solvable in linear time. The experiments show that the algorithm can correctly find the alignment, and runs faster than MGA and EMAGEN. In addition, our algorithm can handle the alignments with overlapping MUMs and has both weighted and unweighted options. It provides the flexibility for the alignments depending on different needs.

MeSH terms

  • Algorithms*
  • Base Sequence
  • Genome*
  • Genomics / methods*
  • Sequence Alignment / methods*
  • Sequence Analysis, DNA