Evidence for widespread existence of functional novel and non-canonical human transcripts

BMC Biol. 2023 Nov 24;21(1):271. doi: 10.1186/s12915-023-01753-5.

Abstract

Background: Fraction of functional sequence in the human genome remains a key unresolved question in Biology and the subject of vigorous debate. While a plethora of studies have connected a significant fraction of human DNA to various biochemical processes, the classical definition of function requires evidence of effects on cellular or organismal fitness that such studies do not provide. Although multiple high-throughput reverse genetics screens have been developed to address this issue, they are limited to annotated genomic elements and suffer from non-specific effects, arguing for a strong need to develop additional functional genomics approaches.

Results: In this work, we established a high-throughput lentivirus-based insertional mutagenesis strategy as a forward genetics screen tool in aneuploid cells. Application of this approach to human cell lines in multiple phenotypic screens suggested the presence of many yet uncharacterized functional elements in the human genome, represented at least in part by novel exons of known and novel genes. The novel transcripts containing these exons can be massively, up to thousands-fold, induced by specific stresses, and at least some can represent bi-cistronic protein-coding mRNAs.

Conclusions: Altogether, these results argue that many unannotated and non-canonical human transcripts, including those that appear as aberrant splice products, have biological relevance under specific biological conditions.

Keywords: Alternative splicing; Bi-cistronic mRNA; Exon prediction; Functional genomics; Insertional mutagenesis; Non-canonical transcript; Novel exon; RNA dark matter; Rapid amplification of cDNA ends; Transcript targeting technique.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing
  • DNA*
  • Exons
  • Genomics* / methods
  • Humans
  • Mutagenesis, Insertional
  • RNA, Messenger / metabolism

Substances

  • RNA, Messenger
  • DNA