Prediction of Signal Peptides in Proteins from Malaria Parasites

Int J Mol Sci. 2018 Nov 22;19(12):3709. doi: 10.3390/ijms19123709.

Abstract

Signal peptides are N-terminal presequences responsible for targeting proteins to the endomembrane system, and subsequent subcellular or extracellular compartments, and consequently condition their proper function. The significance of signal peptides stimulates development of new computational methods for their detection. These methods employ learning systems trained on datasets comprising signal peptides from different types of proteins and taxonomic groups. As a result, the accuracy of predictions are high in the case of signal peptides that are well-represented in databases, but might be low in other, atypical cases. Such atypical signal peptides are present in proteins found in apicomplexan parasites, causative agents of malaria and toxoplasmosis. Apicomplexan proteins have a unique amino acid composition due to their AT-biased genomes. Therefore, we designed a new, more flexible and universal probabilistic model for recognition of atypical eukaryotic signal peptides. Our approach called signalHsmm includes knowledge about the structure of signal peptides and physicochemical properties of amino acids. It is able to recognize signal peptides from the malaria parasites and related species more accurately than popular programs. Moreover, it is still universal enough to provide prediction of other signal peptides on par with the best preforming predictors.

Keywords: HSMM; apicomplexa; hidden semi-Markov model; malaria; plasmodium; signal peptides.

MeSH terms

  • Amino Acids / chemistry
  • Markov Chains
  • Plasmodium / chemistry*
  • Protein Sorting Signals*
  • Protozoan Proteins / chemistry*
  • Sequence Analysis, Protein / methods*
  • Sequence Analysis, Protein / standards

Substances

  • Amino Acids
  • Protein Sorting Signals
  • Protozoan Proteins