Ens-PPI: A Novel Ensemble Classifier for Predicting the Interactions of Proteins Using Autocovariance Transformation from PSSM

Biomed Res Int. 2016:2016:4563524. doi: 10.1155/2016/4563524. Epub 2016 Jun 29.

Abstract

Protein-Protein Interactions (PPIs) play vital roles in most biological activities. Although the development of high-throughput biological technologies has generated considerable PPI data for various organisms, many problems are still far from being solved. A number of computational methods based on machine learning have been developed to facilitate the identification of novel PPIs. In this study, a novel predictor was designed using the Rotation Forest (RF) algorithm combined with Autocovariance (AC) features extracted from the Position-Specific Scoring Matrix (PSSM). More specifically, the PSSMs are generated using the information of protein amino acids sequence. Then, an effective sequence-based features representation, Autocovariance, is employed to extract features from PSSMs. Finally, the RF model is used as a classifier to distinguish between the interacting and noninteracting protein pairs. The proposed method achieves promising prediction performance when performed on the PPIs of Yeast, H. pylori, and independent datasets. The good results show that the proposed model is suitable for PPIs prediction and could also provide a useful supplementary tool for solving other bioinformatics problems.

MeSH terms

  • Algorithms
  • Amino Acid Sequence / genetics*
  • Computational Biology / methods*
  • Helicobacter pylori / genetics
  • Machine Learning
  • Protein Interaction Maps / genetics*
  • Proteins / genetics*
  • Saccharomyces cerevisiae / genetics

Substances

  • Proteins