Inferring Gene Regulatory Networks via Ensemble Path Consistency Algorithm Based on Conditional Mutual Information

IEEE/ACM Trans Comput Biol Bioinform. 2023 May-Jun;20(3):1807-1816. doi: 10.1109/TCBB.2022.3220581. Epub 2023 Jun 5.

Abstract

Utilizing gene expression data to infer gene regulatory networks has received great attention because gene regulation networks can reveal complex life phenomena by studying the interaction mechanism among nodes. However, the reconstruction of large-scale gene regulatory networks is often not ideal due to the curse of dimensionality and the impact of external noise. In order to solve this problem, we introduce a novel algorithms called ensemble path consistency algorithm based on conditional mutual information (EPCACMI), whose threshold of mutual information is dynamically self-adjusted. We first use principal component analysis to decompose a large-scale network into several subnetworks. Then, according to the absolute value of coefficient of each principal component, we could remove a large number of unrelated nodes in every subnetwork and infer the relationships among these selected nodes. Finally, all inferred subnetworks are integrated to form the structure of the complete network. Rather than inferring the whole network directly, the influence of a mass of redundant noise could be weakened. Compared with other related algorithms like MRNET, ARACNE, PCAPMI and PCACMI, the results show that EPCACMI is more effective and more robust when inferring gene regulatory networks with more nodes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computational Biology* / methods
  • Gene Regulatory Networks* / genetics
  • Principal Component Analysis