A novel heterophilic graph diffusion convolutional network for identifying cancer driver genes

Brief Bioinform. 2023 May 19;24(3):bbad137. doi: 10.1093/bib/bbad137.

Abstract

Identifying cancer driver genes plays a curial role in the development of precision oncology and cancer therapeutics. Although a plethora of methods have been developed to tackle this problem, the complex cancer mechanisms and intricate interactions between genes still make the identification of cancer driver genes challenging. In this work, we propose a novel machine learning method of heterophilic graph diffusion convolutional networks (called HGDCs) to boost cancer-driver gene identification. Specifically, HGDC first introduces graph diffusion to generate an auxiliary network for capturing the structurally similar nodes in a biomolecular network. Then, HGDC designs an improved message aggregation and propagation scheme to adapt to the heterophilic setting of biomolecular networks, alleviating the problem of driver gene features being smoothed by its neighboring dissimilar genes. Finally, HGDC uses a layer-wise attention classifier to predict the probability of one gene being a cancer driver gene. In the comparison experiments with other existing state-of-the-art methods, our HGDC achieves outstanding performance in identifying cancer driver genes. The experimental results demonstrate that HGDC not only effectively identifies well-known driver genes on different networks but also novel candidate cancer genes. Moreover, HGDC can effectively prioritize cancer driver genes for individual patients. Particularly, HGDC can identify patient-specific additional driver genes, which work together with the well-known driver genes to cooperatively promote tumorigenesis.

Keywords: cancer driver genes; graph convolutional networks; graph diffusion; heterophilic networks; patient-specific driver genes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cell Transformation, Neoplastic / genetics
  • Gene Regulatory Networks
  • Humans
  • Neoplasms* / genetics
  • Oncogenes
  • Precision Medicine