Network subgraph-based approach for analyzing and comparing molecular networks

PeerJ. 2022 May 3:10:e13137. doi: 10.7717/peerj.13137. eCollection 2022.

Abstract

Molecular networks are built up from genetic elements that exhibit feedback interactions. Here, we studied the problem of measuring the similarity of directed networks by proposing a novel alignment-free approach: the network subgraph-based approach. Our approach does not make use of randomized networks to determine modular patterns embedded in a network, and this method differs from the network motif and graphlet methods. Network similarity was quantified by gauging the difference between the subgraph frequency distributions of two networks using Jensen-Shannon entropy. We applied the subgraph approach to study three types of molecular networks, i.e., cancer networks, signal transduction networks, and cellular process networks, which exhibit diverse molecular functions. We compared the performance of our subgraph detection algorithm with other algorithms, and the results were consistent, but other algorithms could not address the issue of subgraphs/motifs embedded within a subgraph/motif. To evaluate the effectiveness of the subgraph-based method, we applied the method along with the Jensen-Shannon entropy to classify six network models, and it achieves a 100% accuracy of classification. The proposed information-theoretic approach allows us to determine the structural similarity of two networks regardless of node identity and network size. We demonstrated the effectiveness of the subgraph approach to cluster molecular networks that exhibit similar regulatory interaction topologies. As an illustration, our method can identify (i) common subgraph-mediated signal transduction and/or cellular processes in AML and pancreatic cancer, and (ii) scaffold proteins in gastric cancer and hepatocellular carcinoma; thus, the results suggested that there are common regulation modules for cancer formation. We also found that the underlying substructures of the molecular networks are dominated by irreducible subgraphs; this feature is valid for the three classes of molecular networks we studied. The subgraph-based approach provides a systematic scenario for analyzing, compare and classifying molecular networks with diverse functionalities.

Keywords: Cancer networks; Cellular processes; Entropy; Graph theory; Information theory; Jensen-Shannon entropy; Network subgraphs; Signal transduction networks.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Humans
  • Neoplasms*
  • Proteins / chemistry
  • Signal Transduction / physiology

Substances

  • Proteins

Grants and funding

The work of Chien-Hung Huang is supported by the grant of the Ministry of Science and Technology of Taiwan (MOST) (grant number MOST 109-2221-E-150-036). Dr. Ka-Lok Ng and Efendi Zaenudin works are supported by the MOST (grant numbers MOST 109-2221-E-468-013 and MOST 108-2221-E-468-020) and grants from Asia University (grant number ASIA-110-CMUH-12 ). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.