The Arabidopsis root transcriptome by serial analysis of gene expression. Gene identification using the genome sequence

Plant Physiol. 2004 Jan;134(1):67-80. doi: 10.1104/pp.103.030536.

Abstract

Large-scale identification of genes expressed in roots of the model plant Arabidopsis was performed by serial analysis of gene expression (SAGE), on a total of 144,083 sequenced tags, representing at least 15,964 different mRNAs. For tag to gene assignment, we developed a computational approach based on 26,620 genes annotated from the complete sequence of the genome. The procedure selected warrants the identification of the genes corresponding to the majority of the tags found experimentally, with a high level of reliability, and provides a reference database for SAGE studies in Arabidopsis. This new resource allowed us to characterize the expression of more than 3,000 genes, for which there is no expressed sequence tag (EST) or cDNA in the databases. Moreover, 85% of the tags were specific for one gene. To illustrate this advantage of SAGE for functional genomics, we show that our data allow an unambiguous analysis of most of the individual genes belonging to 12 different ion transporter multigene families. These results indicate that, compared with EST-based tag to gene assignment, the use of the annotated genome sequence greatly improves gene identification in SAGE studies. However, more than 6,000 different tags remained with no gene match, suggesting that a significant proportion of transcripts present in the roots originate from yet unknown or wrongly annotated genes. The root transcriptome characterized in this study markedly differs from those obtained in other organs, and provides a unique resource for investigating the functional specificities of the root system. As an example of the use of SAGE for transcript profiling in Arabidopsis, we report here the identification of 270 genes differentially expressed between roots of plants grown either with NO3- or NH4NO3 as N source.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arabidopsis / genetics*
  • Arabidopsis / growth & development
  • Arabidopsis / metabolism
  • Base Sequence
  • DNA, Plant / genetics
  • Databases, Genetic
  • Expressed Sequence Tags
  • Gene Expression
  • Genome, Plant
  • Ion Channels / genetics
  • Ion Transport / genetics
  • Nitrogen / metabolism
  • Plant Roots / genetics
  • RNA, Messenger / genetics
  • RNA, Plant / genetics

Substances

  • DNA, Plant
  • Ion Channels
  • RNA, Messenger
  • RNA, Plant
  • Nitrogen