Methods and strategies for gene structure curation in WormBase

Database (Oxford). 2011 May 3:2011:baq039. doi: 10.1093/database/baq039. Print 2011.

Abstract

The Caenorhabditis elegans genome sequence was published over a decade ago; this was the first published genome of a multi-cellular organism and now the WormBase project has had a decade of experience in curating this genome's sequence and gene structures. In one of its roles as a central repository for nematode biology, WormBase continues to refine the gene structure annotations using sequence similarity and other computational methods, as well as information from the literature- and community-submitted annotations. We describe the various methods of gene structure curation that have been tried by WormBase and the problems associated with each of them. We also describe the current strategy for gene structure curation, and introduce the WormBase 'curation tool', which integrates different data sources in order to identify new and correct gene structures. Database URL: http://www.wormbase.org/.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Base Sequence
  • Caenorhabditis elegans / genetics*
  • Computational Biology / methods*
  • DNA, Intergenic / genetics
  • Databases, Genetic*
  • Genes, Helminth / genetics*
  • High-Throughput Nucleotide Sequencing
  • Molecular Sequence Annotation / methods*
  • Open Reading Frames / genetics

Substances

  • DNA, Intergenic