Integrating multi-network topology for gene function prediction using deep neural networks

Brief Bioinform. 2021 Mar 22;22(2):2096-2105. doi: 10.1093/bib/bbaa036.

Abstract

Motivation: The emergence of abundant biological networks, which benefit from the development of advanced high-throughput techniques, contributes to describing and modeling complex internal interactions among biological entities such as genes and proteins. Multiple networks provide rich information for inferring the function of genes or proteins. To extract functional patterns of genes based on multiple heterogeneous networks, network embedding-based methods, aiming to capture non-linear and low-dimensional feature representation based on network biology, have recently achieved remarkable performance in gene function prediction. However, existing methods do not consider the shared information among different networks during the feature learning process.

Results: Taking the correlation among the networks into account, we design a novel semi-supervised autoencoder method to integrate multiple networks and generate a low-dimensional feature representation. Then we utilize a convolutional neural network based on the integrated feature embedding to annotate unlabeled gene functions. We test our method on both yeast and human datasets and compare with three state-of-the-art methods. The results demonstrate the superior performance of our method. We not only provide a comprehensive analysis of the performance of the newly proposed algorithm but also provide a tool for extracting features of genes based on multiple networks, which can be used in the downstream machine learning task.

Availability: DeepMNE-CNN is freely available at https://github.com/xuehansheng/DeepMNE-CNN.

Contact: jiajiepeng@nwpu.edu.cn; shang@nwpu.edu.cn; jianye.hao@tju.edu.cn.

Keywords: function prediction; multiple networks; network embedding.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Deep Learning*
  • Gene Regulatory Networks
  • Genes, Fungal
  • Humans
  • Molecular Sequence Annotation
  • Neural Networks, Computer*
  • Yeasts / genetics