Efficient computation of motif discovery on Intel Many Integrated Core (MIC) Architecture

BMC Bioinformatics. 2018 Aug 13;19(Suppl 9):282. doi: 10.1186/s12859-018-2276-1.

Abstract

Background: Novel sequence motifs detection is becoming increasingly essential in computational biology. However, the high computational cost greatly constrains the efficiency of most motif discovery algorithms.

Results: In this paper, we accelerate MEME algorithm targeted on Intel Many Integrated Core (MIC) Architecture and present a parallel implementation of MEME called MIC-MEME base on hybrid CPU/MIC computing framework. Our method focuses on parallelizing the starting point searching method and improving iteration updating strategy of the algorithm. MIC-MEME has achieved significant speedups of 26.6 for ZOOPS model and 30.2 for OOPS model on average for the overall runtime when benchmarked on the experimental platform with two Xeon Phi 3120 coprocessors.

Conclusions: Furthermore, MIC-MEME has been compared with state-of-arts methods and it shows good scalability with respect to dataset size and the number of MICs. Source code: https://github.com/hkwkevin28/MIC-MEME .

Keywords: MEME; MIC; Motif discovery; Offload mode.

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Computer Graphics
  • Databases, Genetic
  • Humans
  • Internet
  • Nucleotide Motifs*
  • Promoter Regions, Genetic*
  • Regulatory Elements, Transcriptional*
  • Software*
  • Transcription Factors / metabolism

Substances

  • Transcription Factors