iPiDA-GCN: Identification of piRNA-disease associations based on Graph Convolutional Network

PLoS Comput Biol. 2022 Oct 27;18(10):e1010671. doi: 10.1371/journal.pcbi.1010671. eCollection 2022 Oct.

Abstract

Motivation: Piwi-interacting RNAs (piRNAs) play a critical role in the progression of various diseases. Accurately identifying the associations between piRNAs and diseases is important for diagnosing and prognosticating diseases. Although some computational methods have been proposed to detect piRNA-disease associations, it is challenging for these methods to effectively capture nonlinear and complex relationships between piRNAs and diseases because of the limited training data and insufficient association representation.

Results: With the growth of piRNA-disease association data, it is possible to design a more complex machine learning method to solve this problem. In this study, we propose a computational method called iPiDA-GCN for piRNA-disease association identification based on graph convolutional networks (GCNs). The iPiDA-GCN predictor constructs the graphs based on piRNA sequence information, disease semantic information and known piRNA-disease associations. Two GCNs (Asso-GCN and Sim-GCN) are used to extract the features of both piRNAs and diseases by capturing the association patterns from piRNA-disease interaction network and two similarity networks. GCNs can capture complex network structure information from these networks, and learn discriminative features. Finally, the full connection networks and inner production are utilized as the output module to predict piRNA-disease association scores. Experimental results demonstrate that iPiDA-GCN achieves better performance than the other state-of-the-art methods, benefitted from the discriminative features extracted by Asso-GCN and Sim-GCN. The iPiDA-GCN predictor is able to detect new piRNA-disease associations to reveal the potential pathogenesis at the RNA level. The data and source code are available at http://bliulab.net/iPiDA-GCN/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • RNA, Small Interfering / genetics
  • Software*
  • Support Vector Machine*

Substances

  • RNA, Small Interfering

Grants and funding

This work was supported by the National Key R&D Program of China (No. 2018AAA0100100) and the National Natural Science Foundation of China (No. 62271049) to (BL). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.