Bicistronic and fused monocistronic transcripts are derived from adjacent loci in the Arabidopsis genome

RNA. 2005 Feb;11(2):128-38. doi: 10.1261/rna.7114505.

Abstract

Comparisons of full-length cDNAs and genomic DNAs available for Arabidopsis thaliana described here indicate that some adjacent loci are transcribed into extremely long RNAs spanning two annotated genes. Once expressed, some of these transcripts are post-transcriptionally spliced within their coding and intergenic sequences to generate bicistronic transcripts containing two complete open reading frames. Others are spliced to generate monocistronic transcripts coding for fusion proteins with sequences derived from both loci. RT-PCR of several P450 transcripts in this collection indicates that these extended transcripts exist side by side with shorter monocistronic transcripts derived from the individual loci in each pair. The existence of these unusual transcripts highlights variations in the processes of transcription and splicing that could not possibly have been predicted in the algorithms used for genome annotation and splice site predictions.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Arabidopsis / genetics*
  • Base Sequence
  • Cytochrome P-450 Enzyme System / genetics
  • DNA, Complementary / genetics
  • DNA, Plant / genetics
  • Genes
  • Genome, Plant*
  • Introns
  • Models, Genetic
  • Open Reading Frames
  • RNA Precursors / genetics
  • RNA Splicing
  • RNA, Plant / genetics
  • Transcription, Genetic

Substances

  • DNA, Complementary
  • DNA, Plant
  • RNA Precursors
  • RNA, Plant
  • Cytochrome P-450 Enzyme System