Web-queryable large-scale data sets for hypothesis generation in plant biology

Plant Cell. 2009 Apr;21(4):1034-51. doi: 10.1105/tpc.109.066050. Epub 2009 Apr 28.

Abstract

The approaching end of the 21st century's first decade marks an exciting time for plant biology. Several National Science Foundation Arabidopsis 2010 Projects will conclude, and whether or not the stated goal of the National Science Foundation 2010 Program-to determine the function of 25,000 Arabidopsis genes by 2010-is reached, these projects and others in a similar vein, such as those performed by the AtGenExpress Consortium and various plant genome sequencing initiatives, have generated important and unprecedented large-scale data sets. While providing significant biological insights for the individual laboratories that generated them, these data sets, in conjunction with the appropriate tools, are also permitting plant biologists worldwide to gain new insights into their own biological systems of interest, often at a mouse click through a Web browser. This review provides an overview of several such genomic, epigenomic, transcriptomic, proteomic, and metabolomic data sets and describes Web-based tools for querying them in the context of hypothesis generation for plant biology. We provide five biological examples of how such tools and data sets have been used to provide biological insight.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Arabidopsis / genetics*
  • Computational Biology*
  • Databases, Genetic*
  • Epigenesis, Genetic
  • Gene Expression Profiling
  • Genomics
  • Internet*
  • Proteomics
  • Sequence Analysis, DNA
  • Sequence Analysis, Protein