Comparative analysis of the base compositions of the pre-mRNA 3' cleaved-off region and the mRNA 3' untranslated region relative to the genomic base composition in animals and plants

PLoS One. 2014 Jun 18;9(6):e99928. doi: 10.1371/journal.pone.0099928. eCollection 2014.

Abstract

The precursor messenger RNA (pre-mRNA) three-prime cleaved-off region (3'COR) and the mRNA three-prime untranslated region (3'UTR) play critical roles in regulating gene expression. The differences in base composition between these regions and the corresponding genomes are still largely uncharacterized in animals and plants. In this study, the base compositions of non-redundant 3'CORs and 3'UTRs were compared with the corresponding whole genomes of eleven animals, four dicotyledonous plants, and three monocotyledonous (cereal) plants. Among the four bases (A, C, G, and U for adenine, cytosine, guanine, and uracil, respectively), U (which corresponds to T, for thymine, in DNA) was the most frequent, A the second most frequent, G the third most frequent, and C the least frequent in most of the species in both the 3'COR and 3'UTR regions. In comparison with the whole genomes, in both regions the U content was usually the most overrepresented (particularly in the monocotyledonous plants), and the C content was the most underrepresented. The order obtained for the species groups, when ranked from high to low according to the U contents in the 3'COR and 3'UTR was as follows: dicotyledonous plants, monocotyledonous plants, non-mammal animals, and mammals. In contrast, the genomic T content was highest in dicotyledonous plants, lowest in monocotyledonous plants, and intermediate in animals. These results suggest the following: 1) there is a mechanism operating in both animals and plants which is biased toward U and against C in the 3'COR and 3'UTR; 2) the 3'UTR and 3'COR, as functional units, minimized the difference between dicotyledonous and monocotyledonous plants, while the dicotyledonous and monocotyledonous genomes evolved into two extreme groups in terms of base composition.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • 3' Untranslated Regions / genetics*
  • Adenosine / genetics
  • Analysis of Variance
  • Animals
  • Base Composition / genetics*
  • Genome / genetics*
  • Plants / genetics*
  • Polyadenylation / genetics
  • RNA Precursors / genetics*
  • RNA, Messenger / genetics*
  • RNA, Messenger / metabolism
  • Sequence Analysis, RNA
  • Species Specificity

Substances

  • 3' Untranslated Regions
  • RNA Precursors
  • RNA, Messenger
  • Adenosine

Grants and funding

This research was supported by XQL’s funding from Agriculture and Agri-Food Canada. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.