A parallel and sensitive software tool for methylation analysis on multicore platforms

Bioinformatics. 2015 Oct 1;31(19):3130-8. doi: 10.1093/bioinformatics/btv357. Epub 2015 Jun 10.

Abstract

Motivation: DNA methylation analysis suffers from very long processing time, as the advent of Next-Generation Sequencers has shifted the bottleneck of genomic studies from the sequencers that obtain the DNA samples to the software that performs the analysis of these samples. The existing software for methylation analysis does not seem to scale efficiently neither with the size of the dataset nor with the length of the reads to be analyzed. As it is expected that the sequencers will provide longer and longer reads in the near future, efficient and scalable methylation software should be developed.

Results: We present a new software tool, called HPG-Methyl, which efficiently maps bisulphite sequencing reads on DNA, analyzing DNA methylation. The strategy used by this software consists of leveraging the speed of the Burrows-Wheeler Transform to map a large number of DNA fragments (reads) rapidly, as well as the accuracy of the Smith-Waterman algorithm, which is exclusively employed to deal with the most ambiguous and shortest reads. Experimental results on platforms with Intel multicore processors show that HPG-Methyl significantly outperforms in both execution time and sensitivity state-of-the-art software such as Bismark, BS-Seeker or BSMAP, particularly for long bisulphite reads.

Availability and implementation: Software in the form of C libraries and functions, together with instructions to compile and execute this software. Available by sftp to anonymous@clariano.uv.es (password 'anonymous').

Contact: juan.orduna@uv.es or jdopazo@cipf.es.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Base Sequence
  • DNA Methylation / genetics*
  • Databases, Genetic
  • Genome
  • Genomics / methods*
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Molecular Sequence Data
  • Mutation / genetics
  • Mutation Rate
  • Software*
  • Sulfites
  • Time Factors

Substances

  • Sulfites
  • hydrogen sulfite