Genome-wide computational approach for the prediction of duplications generating protein localization signals

Comput Biol Med. 2012 Nov;42(11):1091-7. doi: 10.1016/j.compbiomed.2012.09.001. Epub 2012 Sep 24.

Abstract

Investigating the possible generation of motifs accountable for aberrant protein dislocation subsequent to the rise of short tandem duplications is interesting, given the pathogenic potential of this mechanism, as demonstrated in diseases such adult myeloid leukemia (AML). In this paper we introduce a new computational method for predicting genomic points which, after hypothetical mutation events such as micro-duplications, might encode molecular patterns such as localization or export signals. The proposed framework allows to study motifs of unconstrained length defined as regular expressions at a genome-wide level, providing an in silico platform capable of analyzing the potential effect of duplications on abnormal cellular localization.

MeSH terms

  • Algorithms
  • Computer Simulation
  • Gene Duplication*
  • Genomics / methods*
  • Humans
  • Models, Genetic
  • Mutation
  • Pattern Recognition, Automated
  • Protein Sorting Signals / genetics*
  • Proteins / genetics
  • Proteins / metabolism
  • Tandem Repeat Sequences*

Substances

  • Protein Sorting Signals
  • Proteins