i-Genome: a database to summarize oligonucleotide data in genomes

BMC Genomics. 2004 Oct 9:5:78. doi: 10.1186/1471-2164-5-78.

Abstract

Background: Information on the occurrence of sequence features in genomes is crucial to comparative genomics, evolutionary analysis, the analyses of regulatory sequences and the quantitative evaluation of sequences. Computing the frequencies and the occurrences of a pattern in complete genomes is time-consuming.

Results: The proposed database provides information about sequence features generated by exhaustively computing the sequences of the complete genome. The repetitive elements in the eukaryotic genomes, such as LINEs, SINEs, Alu and LTR, are obtained from Repbase. The database supports various complete genomes including human, yeast, worm, and 128 microbial genomes.

Conclusions: This investigation presents and implements an efficiently computational approach to accumulate the occurrences of the oligonucleotides or patterns in complete genomes. A database is established to maintain the information of the sequence features, including the distributions of oligonucleotide, the gene distribution, the distribution of repetitive elements in genomes and the occurrences of the oligonucleotides. The database can provide more effective and efficient way to access the repetitive features in genomes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alu Elements / genetics
  • Animals
  • Caenorhabditis elegans / genetics
  • Databases, Genetic*
  • Genome*
  • Genome, Bacterial
  • Genome, Fungal
  • Genome, Human
  • Humans
  • Long Interspersed Nucleotide Elements / genetics
  • Oligonucleotides / genetics*
  • Saccharomyces cerevisiae / genetics
  • Short Interspersed Nucleotide Elements / genetics
  • Terminal Repeat Sequences / genetics

Substances

  • Oligonucleotides