On the analysis of large-scale genomic structures

Cell Biochem Biophys. 2005;42(2):145-65. doi: 10.1385/CBB:42:2:145.

Abstract

We apply methods from statistical physics (histograms, correlation functions, fractal dimensions, and singularity spectra) to characterize large-scale structure of the distribution of nucleotides along genomic sequences. We discuss the role of the extension of noncoding segments ("junk DNA") for the genomic organization, and the connection between the coding segment distribution and the high-eukaryotic chromatin condensation. The following sequences taken from GenBank were analyzed: complete genome of Xanthomonas campestri, complete genome of yeast, chromosome V of Caenorhabditis elegans, and human chromosome XVII around gene BRCA1. The results are compared with the random and periodic sequences and those generated by simple and generalized fractal Cantor sets.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Animals
  • Base Sequence
  • Chromosome Mapping / methods*
  • Fractals*
  • Humans
  • Models, Genetic*
  • Models, Statistical*
  • Molecular Sequence Data
  • Open Reading Frames / genetics*
  • Sequence Analysis, DNA / methods*
  • Statistical Distributions