Predicting the impact of sequence motifs on gene regulation using single-cell data

Genome Biol. 2023 Aug 15;24(1):189. doi: 10.1186/s13059-023-03021-9.

Abstract

The binding of transcription factors at proximal promoters and distal enhancers is central to gene regulation. Identifying regulatory motifs and quantifying their impact on expression remains challenging. Using a convolutional neural network trained on single-cell data, we infer putative regulatory motifs and cell type-specific importance. Our model, scover, explains 29% of the variance in gene expression in multiple mouse tissues. Applying scover to distal enhancers identified using scATAC-seq from the developing human brain, we identify cell type-specific motif activities in distal enhancers. Scover can identify regulatory motifs and their importance from single-cell data where all parameters and outputs are easily interpretable.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Enhancer Elements, Genetic*
  • Gene Expression Regulation*
  • Humans
  • Mice
  • Neural Networks, Computer
  • Nucleotide Motifs
  • Promoter Regions, Genetic
  • Transcription Factors / genetics
  • Transcription Factors / metabolism

Substances

  • Transcription Factors