A probabilistic graph-theoretic approach to integrate multiple predictions for the protein-protein subnetwork prediction challenge

Ann N Y Acad Sci. 2009 Mar:1158:224-33. doi: 10.1111/j.1749-6632.2008.03760.x.

Abstract

The protein-protein subnetwork prediction challenge presented at the 2nd Dialogue for Reverse Engineering Assessments and Methods (DREAM2) conference is an important computational problem essential to proteomic research. Given a set of proteins from the Saccharomyces cerevisiae (baker's yeast) genome, the task is to rank all possible interactions between the proteins from the most likely to the least likely. To tackle this task, we adopt a graph-based strategy to combine multiple sources of biological data and computational predictions. Using training and testing sets extracted from existing yeast protein-protein interactions, we evaluate our method and show that it can produce better predictions than any of the individual data sources. This technique is then used to produce our entry for the protein-protein subnetwork prediction challenge.

MeSH terms

  • Area Under Curve
  • Computational Biology / methods*
  • Databases, Protein
  • Genome, Fungal
  • Models, Genetic
  • Protein Interaction Mapping*
  • ROC Curve
  • Saccharomyces cerevisiae / genetics
  • Saccharomyces cerevisiae / metabolism
  • Saccharomyces cerevisiae Proteins* / genetics
  • Saccharomyces cerevisiae Proteins* / metabolism

Substances

  • Saccharomyces cerevisiae Proteins