TGMI: an efficient algorithm for identifying pathway regulators through evaluation of triple-gene mutual interaction

Nucleic Acids Res. 2018 Jun 20;46(11):e67. doi: 10.1093/nar/gky210.

Abstract

Despite their important roles, the regulators for most metabolic pathways and biological processes remain elusive. Presently, the methods for identifying metabolic pathway and biological process regulators are intensively sought after. We developed a novel algorithm called triple-gene mutual interaction (TGMI) for identifying these regulators using high-throughput gene expression data. It first calculated the regulatory interactions among triple gene blocks (two pathway genes and one transcription factor (TF)), using conditional mutual information, and then identifies significantly interacted triple genes using a newly identified novel mutual interaction measure (MIM), which was substantiated to reflect strengths of regulatory interactions within each triple gene block. The TGMI calculated the MIM for each triple gene block and then examined its statistical significance using bootstrap. Finally, the frequencies of all TFs present in all significantly interacted triple gene blocks were calculated and ranked. We showed that the TFs with higher frequencies were usually genuine pathway regulators upon evaluating multiple pathways in plants, animals and yeast. Comparison of TGMI with several other algorithms demonstrated its higher accuracy. Therefore, TGMI will be a valuable tool that can help biologists to identify regulators of metabolic pathways and biological processes from the exploded high-throughput gene expression data in public repositories.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Animals
  • Arabidopsis / genetics
  • Arabidopsis / metabolism*
  • Cell Cycle
  • Embryonic Stem Cells / physiology
  • Gene Expression Profiling / methods
  • Gene Expression Regulation / genetics*
  • Gene Regulatory Networks / genetics*
  • Lignin / biosynthesis
  • Metabolic Networks and Pathways / genetics*
  • Metabolic Networks and Pathways / physiology*
  • Mice
  • Pluripotent Stem Cells / physiology
  • Saccharomyces cerevisiae / genetics
  • Saccharomyces cerevisiae / metabolism*
  • Transcription Factors / biosynthesis

Substances

  • Transcription Factors
  • Lignin