A graph auto-encoder model for miRNA-disease associations prediction

Brief Bioinform. 2021 Jul 20;22(4):bbaa240. doi: 10.1093/bib/bbaa240.

Abstract

Emerging evidence indicates that the abnormal expression of miRNAs involves in the evolution and progression of various human complex diseases. Identifying disease-related miRNAs as new biomarkers can promote the development of disease pathology and clinical medicine. However, designing biological experiments to validate disease-related miRNAs is usually time-consuming and expensive. Therefore, it is urgent to design effective computational methods for predicting potential miRNA-disease associations. Inspired by the great progress of graph neural networks in link prediction, we propose a novel graph auto-encoder model, named GAEMDA, to identify the potential miRNA-disease associations in an end-to-end manner. More specifically, the GAEMDA model applies a graph neural networks-based encoder, which contains aggregator function and multi-layer perceptron for aggregating nodes' neighborhood information, to generate the low-dimensional embeddings of miRNA and disease nodes and realize the effective fusion of heterogeneous information. Then, the embeddings of miRNA and disease nodes are fed into a bilinear decoder to identify the potential links between miRNA and disease nodes. The experimental results indicate that GAEMDA achieves the average area under the curve of $93.56\pm 0.44\%$ under 5-fold cross-validation. Besides, we further carried out case studies on colon neoplasms, esophageal neoplasms and kidney neoplasms. As a result, 48 of the top 50 predicted miRNAs associated with these diseases are confirmed by the database of differentially expressed miRNAs in human cancers and microRNA deregulation in human disease database, respectively. The satisfactory prediction performance suggests that GAEMDA model could serve as a reliable tool to guide the following researches on the regulatory role of miRNAs. Besides, the source codes are available at https://github.com/chimianbuhetang/GAEMDA.

Keywords: complex disease; graph auto-encoder; graph neural networks; heterogeneous graph; miRNA; miRNA-disease associations prediction.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Databases, Genetic*
  • Gene Expression Regulation, Neoplastic*
  • Humans
  • MicroRNAs* / biosynthesis
  • MicroRNAs* / genetics
  • Models, Genetic*
  • Neoplasms* / genetics
  • Neoplasms* / metabolism
  • Neural Networks, Computer*
  • RNA, Neoplasm* / biosynthesis
  • RNA, Neoplasm* / genetics
  • Software*

Substances

  • MicroRNAs
  • RNA, Neoplasm