Improved short adjacent repeat identification using three evolutionary Monte Carlo schemes

Int J Data Min Bioinform. 2013;8(4):462-79. doi: 10.1504/ijdmb.2013.056614.

Abstract

This paper employs three Evolutionary Monte Carlo (EMC) schemes to solve the Short Adjacent Repeat Identification Problem (SARIP), which aims to identify the common repeat units shared by multiple sequences. The three EMC schemes, i.e., Random Exchange (RE), Best Exchange (BE), and crossover are implemented on a parallel platform. The simulation results show that compared with the conventional Markov Chain Monte Carlo (MCMC) algorithm, all three EMC schemes can not only shorten the computation time via speeding up the convergence but also improve the solution quality in difficult cases. Moreover, we observe that the performances of different EMC schemes depend on the degeneracy degree of the motif pattern.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Markov Chains
  • Microsatellite Repeats*
  • Monte Carlo Method*
  • Pattern Recognition, Automated*