Analysis of human upstream open reading frames and impact on gene expression

Hum Genet. 2015 Jun;134(6):605-12. doi: 10.1007/s00439-015-1544-7. Epub 2015 Mar 24.

Abstract

The upstream open reading frame (uORF) is a post-transcriptional regulatory element in the 5' untranslated region (5'UTR), which modulates the translation levels of main open reading frame (mORF). Earlier studies showed that disturbed uORF-mediated translation control can result in drastic changes in translation levels of mORF, leading to genetic disorders. To date, there has been no systematic investigation into the relationship between variations in patients and uORF status. Here, taking the advantage of several datasets, including gene ontology (GO) annotations and sequence feature analysis, we have examined uORF impacts in human transcripts. GO annotations indicate that uORF-containing genes are enriched in certain features such as oncogenes and transcription factors. Sequence feature analysis reveals that uORF is a factor for determination of the translation initiation site (TIS) in human transcripts. We show that genes with uORFs have lower protein expression levels than genes without uORFs in multiple human tissues. Moreover, by examining three disease variation databases, we identified uORF-altering mutations from a total of 3,740,225 variations, which are highly suspected to be associated with changed levels of gene expression. For an experimental validation, we found four mutations with significant effects on protein expression but with only modest changes in transcription levels. These findings will provide researchers on related diseases with new insights into the importance of known mutations.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • 5' Untranslated Regions / genetics*
  • Cell Line
  • Databases, Genetic
  • Gene Expression Regulation / genetics*
  • Humans
  • Mutation*
  • Open Reading Frames / genetics*
  • Response Elements / genetics*

Substances

  • 5' Untranslated Regions