GraphTGI: an attention-based graph embedding model for predicting TF-target gene interactions

Zhi-Hua Du; Yang-Han Wu; Yu-An Huang; Jie Chen; Gui-Qing Pan; Lun Hu; Zhu-Hong You; Jian-Qiang Li

doi:10.1093/bib/bbac148

GraphTGI: an attention-based graph embedding model for predicting TF-target gene interactions

Brief Bioinform. 2022 May 13;23(3):bbac148. doi: 10.1093/bib/bbac148.

Authors

Zhi-Hua Du¹, Yang-Han Wu¹, Yu-An Huang¹, Jie Chen¹, Gui-Qing Pan¹, Lun Hu¹, Zhu-Hong You², Jian-Qiang Li¹

Affiliations

¹ College of Computer Science and Software Engineering, ShenZhen University, 3688 Nanhai Avenue, Shenzhen, China.
² School of Computer Science, Northwestern Polytechnical University, Xi'an, China.

PMID: 35511108
DOI: 10.1093/bib/bbac148

Abstract

Motivation: Interaction between transcription factor (TF) and its target genes establishes the knowledge foundation for biological researches in transcriptional regulation, the number of which is, however, still limited by biological techniques. Existing computational methods relevant to the prediction of TF-target interactions are mostly proposed for predicting binding sites, rather than directly predicting the interactions. To this end, we propose here a graph attention-based autoencoder model to predict TF-target gene interactions using the information of the known TF-target gene interaction network combined with two sequential and chemical gene characters, considering that the unobserved interactions between transcription factors and target genes can be predicted by learning the pattern of the known ones. To the best of our knowledge, the proposed model is the first attempt to solve this problem by learning patterns from the known TF-target gene interaction network.

Results: In this paper, we formulate the prediction task of TF-target gene interactions as a link prediction problem on a complex knowledge graph and propose a deep learning model called GraphTGI, which is composed of a graph attention-based encoder and a bilinear decoder. We evaluated the prediction performance of the proposed method on a real dataset, and the experimental results show that the proposed model yields outstanding performance with an average AUC value of 0.8864 +/- 0.0057 in the 5-fold cross-validation. It is anticipated that the GraphTGI model can effectively and efficiently predict TF-target gene interactions on a large scale.

Availability: Python code and the datasets used in our studies are made available at https://github.com/YanghanWu/GraphTGI.

Keywords: chemical similarity; graph auto-encoder; graph neural network; transcription factor; transcriptional regulatory network.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Neural Networks, Computer*