SecretP: a new method for predicting mammalian secreted proteins

Peptides. 2010 Apr;31(4):574-8. doi: 10.1016/j.peptides.2009.12.026. Epub 2010 Jan 4.

Abstract

In contrast to a large number of classically secreted proteins (CSPs) and non-secreted proteins (NSPs), only a few proteins have been experimentally proved to enter non-classical secretory pathways. So it is difficult to identify non-classically secreted proteins (NCSPs), and no methods are available for distinguishing the three types of proteins simultaneously. In order to solve this problem, a data mining has been taken firstly, and mammalian proteins exported via ER-Golgi-independent pathways are collected through extensive literature searches. In this paper, a support vector machine (SVM)-based ternary classifier named SecretP is proposed to predict mammalian secreted proteins by using pseudo-amino acid composition (PseAA) and five additional features. When distinguishing the three types of proteins, SecretP yielded an accuracy of 88.79%. Evaluating the performance of our method by an independent test set of 92 human proteins, 76 of them are correctly predicted as NCSPs. When performed on another public independent data set, the prediction result of SecretP is comparable to those of other existing computational methods. Therefore, SecretP can be a useful supplementary tool for future secretome studies. The web server SecretP and all supplementary tables listed in this paper are freely available at http://cic.scu.edu.cn/bioinformatics/secretp/index.htm.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Amino Acids / chemistry*
  • Computational Biology / methods
  • Data Mining / methods*
  • Databases, Protein*
  • Humans
  • Molecular Sequence Data
  • Protein Sorting Signals / genetics
  • Proteins* / chemistry
  • Proteins* / genetics
  • Proteins* / metabolism
  • Proteomics / methods
  • Sequence Analysis, Protein / methods*
  • Sequence Homology, Amino Acid

Substances

  • Amino Acids
  • Protein Sorting Signals
  • Proteins