Cis-regulatory element based targeted gene finding: genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana

Bioinformatics. 2005 Jul 15;21(14):3074-81. doi: 10.1093/bioinformatics/bti490. Epub 2005 May 12.

Abstract

Motivation: A fundamental problem of computational genomics is identifying the genes that respond to certain endogenous cues and environmental stimuli. This problem can be referred to as targeted gene finding. Since gene regulation is mainly determined by the binding of transcription factors and cis-regulatory DNA sequences, most existing gene annotation methods, which exploit the conservation of open reading frames, are not effective in finding target genes.

Results: A viable approach to targeted gene finding is to exploit the cis-regulatory elements that are known to be responsible for the transcription of target genes. Given such cis-elements, putative target genes whose promoters contain the elements can be identified. As a case study, we apply the above approach to predict the genes in model plant Arabidopsis thaliana which are inducible by a phytohormone, abscisic acid (ABA), and abiotic stress, such as drought, cold and salinity. We first construct and analyze two ABA specific cis-elements, ABA-responsive element (ABRE) and its coupling element (CE), in A.thaliana, based on their conservation in rice and other cereal plants. We then use the ABRE-CE module to identify putative ABA-responsive genes in A.thaliana. Based on RT-PCR verification and the results from literature, this method has an accuracy rate of 67.5% for the top 40 predictions. The cis-element based targeted gene finding approach is expected to be widely applicable since a large number of cis-elements in many species are available.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Validation Study

MeSH terms

  • Abscisic Acid / pharmacology*
  • Arabidopsis / drug effects
  • Arabidopsis / physiology*
  • Arabidopsis Proteins / genetics*
  • Arabidopsis Proteins / physiology*
  • Chromosome Mapping / methods*
  • Enhancer Elements, Genetic
  • Gene Expression Profiling / methods*
  • Gene Targeting / methods*
  • Genes, Regulator / genetics
  • Genome, Plant
  • Open Reading Frames / genetics
  • Oxidative Stress / drug effects
  • Oxidative Stress / physiology*
  • Promoter Regions, Genetic
  • Sequence Alignment / methods
  • Sequence Analysis, DNA / methods

Substances

  • Arabidopsis Proteins
  • Abscisic Acid