Prediction of cell-penetrating peptides with feature selection techniques

Biochem Biophys Res Commun. 2016 Aug 12;477(1):150-154. doi: 10.1016/j.bbrc.2016.06.035. Epub 2016 Jun 10.

Abstract

Cell-penetrating peptides are a group of peptides which can transport different types of cargo molecules such as drugs across plasma membrane and have been applied in the treatment of various diseases. Thus, the accurate prediction of cell-penetrating peptides with bioinformatics methods will accelerate the development of drug delivery systems. The study aims to develop a powerful model to accurately identify cell-penetrating peptides. At first, the peptides were translated into a set of vectors with the same dimension by using dipeptide compositions. Secondly, the Analysis of Variance-based technique was used to reduce the dimension of the vector and explore the optimized features. Finally, the support vector machine was utilized to discriminate cell-penetrating peptides from non-cell-penetrating peptides. The five-fold cross-validated results showed that our proposed method could achieve an overall prediction accuracy of 83.6%. Based on the proposed model, we constructed a free webserver called C2Pred (http://lin.uestc.edu.cn/server/C2Pred).

Keywords: Analysis of variance; Cell-penetrating peptides; Support vector machine; g-gap dipeptide composition.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cell-Penetrating Peptides / chemistry*
  • Databases, Protein
  • Internet
  • Support Vector Machine

Substances

  • Cell-Penetrating Peptides