MinePath: Mining for Phenotype Differential Sub-paths in Molecular Pathways

PLoS Comput Biol. 2016 Nov 10;12(11):e1005187. doi: 10.1371/journal.pcbi.1005187. eCollection 2016 Nov.

Abstract

Pathway analysis methodologies couple traditional gene expression analysis with knowledge encoded in established molecular pathway networks, offering a promising approach towards the biological interpretation of phenotype differentiating genes. Early pathway analysis methodologies, named as gene set analysis (GSA), view pathways just as plain lists of genes without taking into account either the underlying pathway network topology or the involved gene regulatory relations. These approaches, even if they achieve computational efficiency and simplicity, consider pathways that involve the same genes as equivalent in terms of their gene enrichment characteristics. Most recent pathway analysis approaches take into account the underlying gene regulatory relations by examining their consistency with gene expression profiles and computing a score for each profile. Even with this approach, assessing and scoring single-relations limits the ability to reveal key gene regulation mechanisms hidden in longer pathway sub-paths. We introduce MinePath, a pathway analysis methodology that addresses and overcomes the aforementioned problems. MinePath facilitates the decomposition of pathways into their constituent sub-paths. Decomposition leads to the transformation of single-relations to complex regulation sub-paths. Regulation sub-paths are then matched with gene expression sample profiles in order to evaluate their functional status and to assess phenotype differential power. Assessment of differential power supports the identification of the most discriminant profiles. In addition, MinePath assess the significance of the pathways as a whole, ranking them by their p-values. Comparison results with state-of-the-art pathway analysis systems are indicative for the soundness and reliability of the MinePath approach. In contrast with many pathway analysis tools, MinePath is a web-based system (www.minepath.org) offering dynamic and rich pathway visualization functionality, with the unique characteristic to color regulatory relations between genes and reveal their phenotype inclination. This unique characteristic makes MinePath a valuable tool for in silico molecular biology experimentation as it serves the biomedical researchers' exploratory needs to reveal and interpret the regulatory mechanisms that underlie and putatively govern the expression of target phenotypes.

MeSH terms

  • Algorithms
  • Computer Simulation
  • Data Mining / methods*
  • Databases, Genetic*
  • Gene Expression Profiling / methods*
  • Models, Biological*
  • Proteome / genetics
  • Proteome / metabolism*
  • Signal Transduction / physiology*
  • Software

Substances

  • Proteome

Grants and funding

This work has been partially supported by the European Union and Greek national funds through the National Strategic Reference Framework (NSRF)—Research Funding Program Heracleitus II and the FP7-ICT-2009.5.3, No 270089 P-Medicine project. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.