ZEBRA: a hierarchically integrated gene expression atlas of the murine and human brain at single-cell resolution

Nucleic Acids Res. 2024 Jan 5;52(D1):D1089-D1096. doi: 10.1093/nar/gkad990.

Abstract

The molecular causes and mechanisms of neurodegenerative diseases remain poorly understood. A growing number of single-cell studies have implicated various neural, glial, and immune cell subtypes to affect the mammalian central nervous system in many age-related disorders. Integrating this body of transcriptomic evidence into a comprehensive and reproducible framework poses several computational challenges. Here, we introduce ZEBRA, a large single-cell and single-nucleus RNA-seq database. ZEBRA integrates and normalizes gene expression and metadata from 33 studies, encompassing 4.2 million human and mouse brain cells sampled from 39 brain regions. It incorporates samples from patients with neurodegenerative diseases like Alzheimer's disease, Parkinson's disease, and Multiple sclerosis, as well as samples from relevant mouse models. We employed scVI, a deep probabilistic auto-encoder model, to integrate the samples and curated both cell and sample metadata for downstream analysis. ZEBRA allows for cell-type and disease-specific markers to be explored and compared between sample conditions and brain regions, a cell composition analysis, and gene-wise feature mappings. Our comprehensive molecular database facilitates the generation of data-driven hypotheses, enhancing our understanding of mammalian brain function during aging and disease. The data sets, along with an interactive database are freely available at https://www.ccb.uni-saarland.de/zebra.

MeSH terms

  • Alzheimer Disease / metabolism
  • Animals
  • Brain / metabolism
  • Gene Expression
  • Humans
  • Mice
  • Neurodegenerative Diseases* / genetics
  • Parkinson Disease / metabolism
  • Single-Cell Analysis*
  • Transcriptome