CUDA ClustalW: An efficient parallel algorithm for progressive multiple sequence alignment on Multi-GPUs

Comput Biol Chem. 2015 Oct:58:62-8. doi: 10.1016/j.compbiolchem.2015.05.004. Epub 2015 May 21.

Abstract

For biological applications, sequence alignment is an important strategy to analyze DNA and protein sequences. Multiple sequence alignment is an essential methodology to study biological data, such as homology modeling, phylogenetic reconstruction and etc. However, multiple sequence alignment is a NP-hard problem. In the past decades, progressive approach has been proposed to successfully align multiple sequences by adopting iterative pairwise alignments. Due to rapid growth of the next generation sequencing technologies, a large number of sequences can be produced in a short period of time. When the problem instance is large, progressive alignment will be time consuming. Parallel computing is a suitable solution for such applications, and GPU is one of the important architectures for contemporary parallel computing researches. Therefore, we proposed a GPU version of ClustalW v2.0.11, called CUDA ClustalW v1.0, in this work. From the experiment results, it can be seen that the CUDA ClustalW v1.0 can achieve more than 33× speedups for overall execution time by comparing to ClustalW v2.0.11.

Keywords: CUDA; ClustalW; GPU; Parallel computing; Progressive multiple sequence alignment.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Sequence Alignment*
  • Sequence Analysis, Protein
  • Software*