Protein complex discovery by interaction filtering from protein interaction networks using mutual rank coexpression and sequence similarity

Biomed Res Int. 2015:2015:165186. doi: 10.1155/2015/165186. Epub 2015 Jan 27.

Abstract

The evaluation of the biological networks is considered the essential key to understanding the complex biological systems. Meanwhile, the graph clustering algorithms are mostly used in the protein-protein interaction (PPI) network analysis. The complexes introduced by the clustering algorithms include noise proteins. The error rate of the noise proteins in the PPI network researches is about 40-90%. However, only 30-40% of the existing interactions in the PPI databases depend on the specific biological function. It is essential to eliminate the noise proteins and the interactions from the complexes created via clustering methods. We have introduced new methods of weighting interactions in protein clusters and the splicing of noise interactions and proteins-based interactions on their weights. The coexpression and the sequence similarity of each pair of proteins are considered the edge weight of the proteins in the network. The results showed that the edge filtering based on the amount of coexpression acts similar to the node filtering via graph-based characteristics. Regarding the removal of the noise edges, the edge filtering has a significant advantage over the graph-based method. The edge filtering based on the amount of sequence similarity has the ability to remove the noise proteins and the noise interactions.

MeSH terms

  • Databases, Protein*
  • Gene Expression Regulation / physiology*
  • Gene Regulatory Networks / physiology*
  • Humans
  • Models, Genetic*
  • Proteins / genetics*
  • Proteins / metabolism*
  • Sequence Homology, Amino Acid

Substances

  • Proteins