Resource: A Curated Database of Brain-Related Functional Gene Sets (Brain.GMT)

bioRxiv [Preprint]. 2024 Apr 10:2024.04.05.588301. doi: 10.1101/2024.04.05.588301.

Abstract

Transcriptional profiling has become a common tool for investigating the nervous system. During analysis, differential expression results are often compared to functional ontology databases, which contain curated gene sets representing well-studied pathways. This dependence can cause neuroscience studies to be interpreted in terms of functional pathways documented in better studied tissues (e.g., liver) and topics (e.g., cancer), and systematically emphasizes well-studied genes, leaving other findings in the obscurity of the brain "ignorome". To address this issue, we compiled a curated database of 918 gene sets related to nervous system function, tissue, and cell types ("Brain.GMT") that can be used within common analysis pipelines (GSEA, limma, edgeR) to interpret results from three species (rat, mouse, human). Brain.GMT includes brain-related gene sets curated from the Molecular Signatures Database (MSigDB) and extracted from public databases (GeneWeaver, Gemma, DropViz, BrainInABlender, HippoSeq) and published studies containing differential expression results. Although Brain.GMT is still undergoing development and currently only represents a fraction of available brain gene sets, "brain ignorome" genes are already better represented than in traditional Gene Ontology databases. Moreover, Brain.GMT substantially improves the quantity and quality of gene sets identified as enriched with differential expression in neuroscience studies, enhancing interpretation.

Keywords: Central Nervous System; Differential Expression Analysis; Frontal Cortex; Gene Set Enrichment Analysis (GSEA); Genomics; Hippocampus; Microarray; Nucleus Accumbens; RNA-Seq; Transcriptional Profiling.

Publication types

  • Preprint