Exploiting transfer learning for the reconstruction of the human gene regulatory network

Paolo Mignone; Gianvito Pio; Domenica D'Elia; Michelangelo Ceci

doi:10.1093/bioinformatics/btz781

Exploiting transfer learning for the reconstruction of the human gene regulatory network

Bioinformatics. 2020 Mar 1;36(5):1553-1561. doi: 10.1093/bioinformatics/btz781.

Authors

Paolo Mignone^{1

2}, Gianvito Pio^{1

2}, Domenica D'Elia³, Michelangelo Ceci^{1

2

4}

Affiliations

¹ Department of Computer Science, University of Bari Aldo Moro, Bari 70125, Italy.
² National Interuniversity Consortium for Informatics (CINI), Roma 00185, Italy.
³ Institute for Biomedical Technologies, CNR, Institute for Biomedical Technologies, Bari 70126, Italy.
⁴ Department of Knowledge Technologies, Jožef Stefan Institute, Ljubljana 1000, Slovenia.

PMID: 31608946
DOI: 10.1093/bioinformatics/btz781

Abstract

Motivation: The reconstruction of gene regulatory networks (GRNs) from gene expression data has received increasing attention in recent years, due to its usefulness in the understanding of regulatory mechanisms involved in human diseases. Most of the existing methods reconstruct the network through machine learning approaches, by analyzing known examples of interactions. However, (i) they often produce poor results when the amount of labeled examples is limited, or when no negative example is available and (ii) they are not able to exploit information extracted from GRNs of other (better studied) related organisms, when this information is available.

Results: In this paper, we propose a novel machine learning method that overcomes these limitations, by exploiting the knowledge about the GRN of a source organism for the reconstruction of the GRN of the target organism, by means of a novel transfer learning technique. Moreover, the proposed method is natively able to work in the positive-unlabeled setting, where no negative example is available, by fruitfully exploiting a (possibly large) set of unlabeled examples. In our experiments, we reconstructed the human GRN, by exploiting the knowledge of the GRN of Mus musculus. Results showed that the proposed method outperforms state-of-the-art approaches and identifies previously unknown functional relationships among the analyzed genes.

Availability and implementation: http://www.di.uniba.it/∼mignone/systems/biosfer/index.html.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Animals
Computational Biology
Gene Expression
Gene Expression Profiling
Gene Regulatory Networks*
Humans
Machine Learning
Mice