MaXML: mouse annotation XML

In Silico Biol. 2004;4(1):7-15. Epub 2003 Dec 26.

Abstract

Although the sequencing of the human genome and several model organisms is almost complete, the number of genes in the human is still in debate. cDNA (complementary DNA) is generated from mRNA that is transcribed from the genome and can be regarded as a gene itself; therefore, decoding cDNA sequences is important in characterizing genes. Recently, biologists have been able to describe more knowledge about genes in order to characterize them, and this information is generally called 'annotation.' Furthermore, annotation is important in understanding the systems of organisms in various fields of research. We therefore constructed the MaXML (Mouse annotation XML) format with which mouse cDNA annotation data can be exchanged and shared between laboratories more efficiently. Defining strict data types for annotations is difficult, but we consider XML a feasible format for describing them. We have used the MaXML format to express mouse annotation data in FANTOM DB. We have also developed tools and systems utilizing these MaXML data, including a parser and a server to provide data on-the-fly.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Computational Biology*
  • DNA, Complementary / genetics*
  • Databases, Genetic*
  • Genome
  • Mice / genetics*

Substances

  • DNA, Complementary