MPIGeneNet: Parallel Calculation of Gene Co-Expression Networks on Multicore Clusters

IEEE/ACM Trans Comput Biol Bioinform. 2018 Sep-Oct;15(5):1732-1737. doi: 10.1109/TCBB.2017.2761340. Epub 2017 Oct 10.

Abstract

In this work, we present MPIGeneNet, a parallel tool that applies Pearson's correlation and Random Matrix Theory to construct gene co-expression networks. It is based on the state-of-the-art sequential tool RMTGeneNet, which provides networks with high robustness and sensitivity at the expenses of relatively long runtimes for large scale input datasets. MPIGeneNet returns the same results as RMTGeneNet but improves the memory management, reduces the I/O cost, and accelerates the two most computationally demanding steps of co-expression network construction by exploiting the compute capabilities of common multicore CPU clusters. Our performance evaluation on two different systems using three typical input datasets shows that MPIGeneNet is significantly faster than RMTGeneNet. As an example, our tool is up to 175.41 times faster on a cluster with eight nodes, each one containing two 12-core Intel Haswell processors. The source code of MPIGeneNet, as well as a reference manual, are available at https://sourceforge.net/projects/mpigenenet/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology / methods*
  • Gene Expression / genetics*
  • Gene Regulatory Networks / genetics*
  • Models, Statistical
  • Sequence Alignment
  • Sequence Analysis, DNA
  • Software*