Mining the Enriched Subgraphs for Specific Vertices in a Biological Graph

IEEE/ACM Trans Comput Biol Bioinform. 2019 Sep-Oct;16(5):1496-1507. doi: 10.1109/TCBB.2016.2576440. Epub 2016 Jun 7.

Abstract

In this paper, we present a subgroup discovery method to find subgraphs in a graph that are associated with a given set of vertices. The association between a subgraph pattern and a set of vertices is defined by its significant enrichment based on a Bonferroni-corrected hypergeometric probability value. This interestingness measure requires a dedicated pruning procedure to limit the number of subgraph matches that must be calculated. The presented mining algorithm to find associated subgraph patterns in large graphs is therefore designed to efficiently traverse the search space. We demonstrate the operation of this method by applying it on three biological graph data sets and show that we can find associated subgraphs for a biologically relevant set of vertices and that the found subgraphs themselves are biologically interesting.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Data Mining / methods*
  • Gene Regulatory Networks
  • Models, Statistical
  • Protein Interaction Mapping