NetMODE: network motif detection without Nauty

PLoS One. 2012;7(12):e50093. doi: 10.1371/journal.pone.0050093. Epub 2012 Dec 18.

Abstract

A motif in a network is a connected graph that occurs significantly more frequently as an induced subgraph than would be expected in a similar randomized network. By virtue of being atypical, it is thought that motifs might play a more important role than arbitrary subgraphs. Recently, a flurry of advances in the study of network motifs has created demand for faster computational means for identifying motifs in increasingly larger networks. Motif detection is typically performed by enumerating subgraphs in an input network and in an ensemble of comparison networks; this poses a significant computational problem. Classifying the subgraphs encountered, for instance, is typically performed using a graph canonical labeling package, such as Nauty, and will typically be called billions of times. In this article, we describe an implementation of a network motif detection package, which we call NetMODE. NetMODE can only perform motif detection for [Formula: see text]-node subgraphs when [Formula: see text], but does so without the use of Nauty. To avoid using Nauty, NetMODE has an initial pretreatment phase, where [Formula: see text]-node graph data is stored in memory ([Formula: see text]). For [Formula: see text] we take a novel approach, which relates to the Reconstruction Conjecture for directed graphs. We find that NetMODE can perform up to around [Formula: see text] times faster than its predecessors when [Formula: see text] and up to around [Formula: see text] times faster when [Formula: see text] (the exact improvement varies considerably). NetMODE also (a) includes a method for generating comparison graphs uniformly at random, (b) can interface with external packages (e.g. R), and (c) can utilize multi-core architectures. NetMODE is available from netmode.sf.net.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computer Communication Networks
  • Computer Simulation
  • Computers
  • Models, Biological
  • Models, Statistical
  • Reproducibility of Results
  • Software*

Grants and funding

This work was supported in part by NSFC of China (60903028, 61070014), Key Projects in the Tianjin Science & Technology Pillar Program (11ZCKFGX01100). Stones was also supported by an ARC grant. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.