Identification and functional analyses of 11,769 full-length human cDNAs focused on alternative splicing

DNA Res. 2009 Dec;16(6):371-83. doi: 10.1093/dnares/dsp022. Epub 2009 Oct 30.

Abstract

We analyzed diversity of mRNA produced as a result of alternative splicing in order to evaluate gene function. First, we predicted the number of human genes transcribed into protein-coding mRNAs by using the sequence information of full-length cDNAs and 5'-ESTs and obtained 23 241 of such human genes. Next, using these genes, we analyzed the mRNA diversity and consequently sequenced and identified 11 769 human full-length cDNAs whose predicted open reading frames were different from other known full-length cDNAs. Especially, 30% of the cDNAs we identified contained variation in the transcription start site (TSS). Our analysis, which particularly focused on multiple variable first exons (FEVs) formed due to the alternative utilization of TSSs, led to the identification of 261 FEVs expressed in the tissue-specific manner. Quantification of the expression profiles of 13 genes by real-time PCR analysis further confirmed the tissue-specific expression of FEVs, e.g. OXR1 had specific TSS in brain and tumor tissues, and so on. Finally, based on the results of our mRNA diversity analysis, we have created the FLJ Human cDNA Database. From our result, it has been understood mechanisms that one gene produces suitable protein-coding transcripts responding to the situation and the environment.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing*
  • Chromosome Mapping
  • Computational Biology / methods
  • DNA, Complementary / genetics*
  • DNA, Complementary / metabolism*
  • Databases, Genetic
  • Exons
  • Expressed Sequence Tags
  • Genetic Variation
  • Humans
  • Organ Specificity
  • Proteins* / genetics
  • Proteins* / metabolism
  • RNA, Messenger* / genetics
  • RNA, Messenger* / metabolism
  • Sequence Analysis, DNA
  • Structure-Activity Relationship
  • Transcription Initiation Site

Substances

  • DNA, Complementary
  • Proteins
  • RNA, Messenger