Mining the modular structure of protein interaction networks

PLoS One. 2015 Apr 9;10(4):e0122477. doi: 10.1371/journal.pone.0122477. eCollection 2015.

Abstract

Background: Cluster-based descriptions of biological networks have received much attention in recent years fostered by accumulated evidence of the existence of meaningful correlations between topological network clusters and biological functional modules. Several well-performing clustering algorithms exist to infer topological network partitions. However, due to respective technical idiosyncrasies they might produce dissimilar modular decompositions of a given network. In this contribution, we aimed to analyze how alternative modular descriptions could condition the outcome of follow-up network biology analysis.

Methodology: We considered a human protein interaction network and two paradigmatic cluster recognition algorithms, namely: the Clauset-Newman-Moore and the infomap procedures. We analyzed to what extent both methodologies yielded different results in terms of granularity and biological congruency. In addition, taking into account Guimera's cartographic role characterization of network nodes, we explored how the adoption of a given clustering methodology impinged on the ability to highlight relevant network meso-scale connectivity patterns.

Results: As a case study we considered a set of aging related proteins and showed that only the high-resolution modular description provided by infomap, could unveil statistically significant associations between them and inter/intra modular cartographic features. Besides reporting novel biological insights that could be gained from the discovered associations, our contribution warns against possible technical concerns that might affect the tools used to mine for interaction patterns in network biology studies. In particular our results suggested that sub-optimal partitions from the strict point of view of their modularity levels might still be worth being analyzed when meso-scale features were to be explored in connection with external source of biological knowledge.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aging / genetics*
  • Algorithms*
  • Cluster Analysis
  • Data Mining / statistics & numerical data*
  • Gene Regulatory Networks*
  • Humans
  • Protein Interaction Mapping*
  • Protein Interaction Maps

Grants and funding

CONICET (grant PIP0087), UBACyT (grant 20020110200314), ISCIII-FEDER (PI13/00082 and CP10/00524), IMI JU (grant agreements n° [115002] (eTOX) and n° [115191] (Open PHACTS)], resources of which are composed of financial contribution from the EU's FP7 (FP7/2007–2013) and EFPIA companies’ in kind contribution). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.