An effective approach to detecting both small and large complexes from protein-protein interaction networks

BMC Bioinformatics. 2017 Oct 16;18(Suppl 12):419. doi: 10.1186/s12859-017-1820-8.

Abstract

Background: Predicting protein complexes from protein-protein interaction (PPI) networks has been studied for decade. Various methods have been proposed to address some challenging issues of this problem, including overlapping clusters, high false positive/negative rates of PPI data and diverse complex structures. It is well known that most current methods can detect effectively only complexes of size ≥3, which account for only about half of the total existing complexes. Recently, a method was proposed specifically for finding small complexes (size = 2 and 3) from PPI networks. However, up to now there is no effective approach that can predict both small (size ≤ 3) and large (size >3) complexes from PPI networks.

Results: In this paper, we propose a novel method, called CPredictor2.0, that can detect both small and large complexes under a unified framework. Concretely, we first group proteins of similar functions. Then, the Markov clustering algorithm is employed to discover clusters in each group. Finally, we merge all discovered clusters that overlap with each other to a certain degree, and the merged clusters as well as the remaining clusters constitute the set of detected complexes. Extensive experiments have shown that the new method can more effectively predict both small and large complexes, in comparison with the state-of-the-art methods.

Conclusions: The proposed method, CPredictor2.0, can be applied to accurately predict both small and large protein complexes.

Keywords: Large protein complex; Protein complex prediction; Protein-protein interaction; Small protein complex.

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Databases, Protein
  • Multiprotein Complexes / metabolism*
  • Protein Interaction Mapping / methods*
  • Protein Interaction Maps*
  • Saccharomyces cerevisiae / metabolism*
  • Saccharomyces cerevisiae Proteins / metabolism*

Substances

  • Multiprotein Complexes
  • Saccharomyces cerevisiae Proteins