CNCDatabase: a database of non-coding cancer drivers

Nucleic Acids Res. 2021 Jan 8;49(D1):D1094-D1101. doi: 10.1093/nar/gkaa915.

Abstract

Most mutations in cancer genomes occur in the non-coding regions with unknown impact on tumor development. Although the increase in the number of cancer whole-genome sequences has revealed numerous putative non-coding cancer drivers, their information is dispersed across multiple studies making it difficult to understand their roles in tumorigenesis of different cancer types. We have developed CNCDatabase, Cornell Non-coding Cancer driver Database (https://cncdatabase.med.cornell.edu/) that contains detailed information about predicted non-coding drivers at gene promoters, 5' and 3' UTRs (untranslated regions), enhancers, CTCF insulators and non-coding RNAs. CNCDatabase documents 1111 protein-coding genes and 90 non-coding RNAs with reported drivers in their non-coding regions from 32 cancer types by computational predictions of positive selection using whole-genome sequences; differential gene expression in samples with and without mutations; or another set of experimental validations including luciferase reporter assays and genome editing. The database can be easily modified and scaled as lists of non-coding drivers are revised in the community with larger whole-genome sequencing studies, CRISPR screens and further experimental validations. Overall, CNCDatabase provides a helpful resource for researchers to explore the pathological role of non-coding alterations in human cancers.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • 3' Untranslated Regions
  • 5' Untranslated Regions
  • Carcinogenesis / genetics*
  • Carcinogenesis / metabolism
  • Carcinogenesis / pathology
  • Clustered Regularly Interspaced Short Palindromic Repeats
  • Databases, Genetic*
  • Enhancer Elements, Genetic
  • Gene Expression Regulation, Neoplastic*
  • Genes, Reporter
  • Genome, Human*
  • Humans
  • Insulator Elements
  • Luciferases / genetics
  • Luciferases / metabolism
  • Mutation
  • Neoplasms / genetics*
  • Neoplasms / metabolism
  • Neoplasms / pathology
  • Open Reading Frames
  • Promoter Regions, Genetic
  • RNA, Untranslated / classification
  • RNA, Untranslated / genetics
  • RNA, Untranslated / metabolism
  • Untranslated Regions
  • Whole Genome Sequencing

Substances

  • 3' Untranslated Regions
  • 5' Untranslated Regions
  • RNA, Untranslated
  • Untranslated Regions
  • Luciferases