A method for identifying protein complexes with the features of joint co-localization and joint co-expression in static PPI networks

Comput Biol Med. 2019 Aug:111:103333. doi: 10.1016/j.compbiomed.2019.103333. Epub 2019 Jun 19.

Abstract

Identifying protein complexes in static protein-protein interaction (PPI) networks is essential for understanding the underlying mechanism of biological processes. Proteins in a complex are co-localized at the same place and co-expressed at the same time. We propose a novel method to identify protein complexes with the features of joint co-localization and joint co-expression in static PPI networks. To achieve this goal, we define a joint localization vector to construct a joint co-localization criterion of a protein group, and define a joint gene expression to construct a joint co-expression criterion of a gene group. Moreover, the functional similarity of proteins in a complex is an important characteristic. Thus, we use the CC-based, MF-based, and BP-based protein similarities to devise functional similarity criterion to determine whether a protein is functionally similar to a protein cluster. Based on the core-attachment structure and following to seed expanding strategy, we use four types of biological data including PPI data with reliability score, protein localization data, gene expression data, and gene ontology annotations, to identify protein complexes. The experimental results on yeast data show that comparing with existing methods our proposed method can efficiently and exactly identify more protein complexes, especially more protein complexes of sizes from 2 to 6. Furthermore, the enrichment analysis demonstrates that the protein complexes identified by our method have significant biological meaning.

Keywords: Core-attachment structure; Joint co-expression; Joint co-localization; Protein complexes; Seed expanding strategy; Static PPI networks.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computational Biology
  • Molecular Sequence Annotation
  • Protein Interaction Mapping / methods*
  • Protein Interaction Maps* / genetics
  • Protein Interaction Maps* / physiology
  • Proteins* / genetics
  • Proteins* / metabolism
  • Transcriptome* / genetics
  • Transcriptome* / physiology

Substances

  • Proteins

Associated data

  • figshare/10.6084/m9.figshare.7719296