Representing GC variation along eukaryotic chromosomes

Gene. 2004 May 26:333:135-41. doi: 10.1016/j.gene.2004.02.041.

Abstract

Genome sequencing now permits direct visual representation, at any scale, of GC heterogeneity along the chromosomes of several higher eukaryotes. Plots can be easily obtained from the chromosomal sequences, yet sequence releases of mammalian or plant chromosomes still tend to use small scales or window sizes that obscure important large-scale compositional features. To faithfully reveal, at one glance, the compositional variation at a given scale, we have devised a simple scheme that combines line plots with color-coded shading of the regions underneath the plots. The scheme can be applied to different eukaryotic genomes to facilitate their comparison, as illustrated here for a sample of chromosomes chosen from seven selected species. As a complement to a previously published compact view of isochores in the human genome sequence, we include here an analogous map for the recently sequenced mouse genome, and discuss the contribution of repetitive DNA to the GC variation along the plots. Supplementary information, including a database of color-coded GC profiles for all recently sequenced eukaryotes and the program draw_chromosomes_gc.pl used to obtain them, are available at.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Anopheles / genetics
  • Arabidopsis / genetics
  • Base Composition / genetics*
  • Caenorhabditis elegans / genetics
  • Chromosomes / genetics*
  • DNA / genetics
  • Drosophila melanogaster / genetics
  • Eukaryotic Cells / metabolism*
  • Genome
  • Humans
  • Isochores / genetics
  • Mice
  • Repetitive Sequences, Nucleic Acid / genetics
  • Saccharomyces cerevisiae / genetics
  • Software

Substances

  • Isochores
  • DNA