CellGO: a novel deep learning-based framework and webserver for cell-type-specific gene function interpretation

Brief Bioinform. 2023 Nov 22;25(1):bbad417. doi: 10.1093/bib/bbad417.

Abstract

Interpreting the function of genes and gene sets identified from omics experiments remains a challenge, as current pathway analysis tools often fail to consider the critical biological context, such as tissue or cell-type specificity. To address this limitation, we introduced CellGO. CellGO tackles this challenge by leveraging the visible neural network (VNN) and single-cell gene expressions to mimic cell-type-specific signaling propagation along the Gene Ontology tree within a cell. This design enables a novel scoring system to calculate the cell-type-specific gene-pathway paired active scores, based on which, CellGO is able to identify cell-type-specific active pathways associated with single genes. In addition, by aggregating the activities of single genes, CellGO extends its capability to identify cell-type-specific active pathways for a given gene set. To enhance biological interpretation, CellGO offers additional features, including the identification of significantly active cell types and driver genes and community analysis of pathways. To validate its performance, CellGO was assessed using a gene set comprising mixed cell-type markers, confirming its ability to discern active pathways across distinct cell types. Subsequent benchmarking analyses demonstrated CellGO's superiority in effectively identifying cell types and their corresponding cell-type-specific pathways affected by gene knockouts, using either single genes or sets of genes differentially expressed between knockout and control samples. Moreover, CellGO demonstrated its ability to infer cell-type-specific pathogenesis for disease risk genes. Accessible as a Python package, CellGO also provides a user-friendly web interface, making it a versatile and accessible tool for researchers in the field.

Keywords: Gene Ontology; artificial neural network; cell-type-specific pathway analysis; disease risk genes; gene functional analysis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Deep Learning*
  • Disease Susceptibility
  • Humans
  • Software*