Comparing statistical methods for constructing large scale gene networks

PLoS One. 2012;7(1):e29348. doi: 10.1371/journal.pone.0029348. Epub 2012 Jan 17.

Abstract

The gene regulatory network (GRN) reveals the regulatory relationships among genes and can provide a systematic understanding of molecular mechanisms underlying biological processes. The importance of computer simulations in understanding cellular processes is now widely accepted; a variety of algorithms have been developed to study these biological networks. The goal of this study is to provide a comprehensive evaluation and a practical guide to aid in choosing statistical methods for constructing large scale GRNs. Using both simulation studies and a real application in E. coli data, we compare different methods in terms of sensitivity and specificity in identifying the true connections and the hub genes, the ease of use, and computational speed. Our results show that these algorithms performed reasonably well, and each method has its own advantages: (1) GeneNet, WGCNA (Weighted Correlation Network Analysis), and ARACNE (Algorithm for the Reconstruction of Accurate Cellular Networks) performed well in constructing the global network structure; (2) GeneNet and SPACE (Sparse PArtial Correlation Estimation) performed well in identifying a few connections with high specificity.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Computational Biology / methods*
  • Escherichia coli / genetics
  • Escherichia coli / metabolism
  • Escherichia coli Proteins / genetics
  • Escherichia coli Proteins / metabolism
  • Gene Regulatory Networks*
  • Protein Interaction Mapping / methods*
  • Protein Interaction Maps
  • Reproducibility of Results

Substances

  • Escherichia coli Proteins