A Type-2 fuzzy data fusion approach for building reliable weighted protein interaction networks with application in protein complex detection

Comput Biol Med. 2017 Sep 1:88:18-31. doi: 10.1016/j.compbiomed.2017.06.019. Epub 2017 Jun 23.

Abstract

Detecting the protein complexes is an important task in analyzing the protein interaction networks. Although many algorithms predict protein complexes in different ways, surveys on the interaction networks indicate that about 50% of detected interactions are false positives. Consequently, the accuracy of existing methods needs to be improved. In this paper we propose a novel algorithm to detect the protein complexes in 'noisy' protein interaction data. First, we integrate several biological data sources to determine the reliability of each interaction and determine more accurate weights for the interactions. A data fusion component is used for this step, based on the interval type-2 fuzzy voter that provides an efficient combination of the information sources. This fusion component detects the errors and diminishes their effect on the detection protein complexes. So in the first step, the reliability scores have been assigned for every interaction in the network. In the second step, we have proposed a general protein complex detection algorithm by exploiting and adopting the strong points of other algorithms and existing hypotheses regarding real complexes. Finally, the proposed method has been applied for the yeast interaction datasets for predicting the interactions. The results show that our framework has a better performance regarding precision and F-measure than the existing approaches.

Keywords: Biological similarity; Fuzzy voter; Interval Type-2 fuzzy sets; Protein complexes; Protein-protein interaction networks; Topological similarity.

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Databases, Protein*
  • Fuzzy Logic*
  • Gene Expression Profiling
  • Multiprotein Complexes / chemistry*
  • Multiprotein Complexes / metabolism*
  • Protein Interaction Mapping / methods*
  • Protein Interaction Maps
  • Semantics

Substances

  • Multiprotein Complexes