Use of data-biased random walks on graphs for the retrieval of context-specific networks from genomic data

PLoS Comput Biol. 2010 Aug 19;6(8):e1000889. doi: 10.1371/journal.pcbi.1000889.

Abstract

Extracting network-based functional relationships within genomic datasets is an important challenge in the computational analysis of large-scale data. Although many methods, both public and commercial, have been developed, the problem of identifying networks of interactions that are most relevant to the given input data still remains an open issue. Here, we have leveraged the method of random walks on graphs as a powerful platform for scoring network components based on simultaneous assessment of the experimental data as well as local network connectivity. Using this method, NetWalk, we can calculate distribution of Edge Flux values associated with each interaction in the network, which reflects the relevance of interactions based on the experimental data. We show that network-based analyses of genomic data are simpler and more accurate using NetWalk than with some of the currently employed methods. We also present NetWalk analysis of microarray gene expression data from MCF7 cells exposed to different doses of doxorubicin, which reveals a switch-like pattern in the p53 regulated network in cell cycle arrest and apoptosis. Our analyses demonstrate the use of NetWalk as a valuable tool in generating high-confidence hypotheses from high-content genomic data.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Apoptosis / drug effects
  • Apoptosis / genetics
  • Breast Neoplasms / drug therapy
  • Breast Neoplasms / genetics*
  • Cell Cycle / drug effects
  • Cell Cycle / genetics
  • Cell Line, Tumor
  • Computational Biology / methods
  • Doxorubicin / therapeutic use
  • Female
  • Gene Expression Profiling / methods
  • Gene Regulatory Networks*
  • Genes, p53
  • Genomics / methods*
  • Genomics / statistics & numerical data*
  • Humans
  • Models, Biological
  • Oligonucleotide Array Sequence Analysis
  • Stochastic Processes

Substances

  • Doxorubicin