Nucleotide spacing distribution analysis for human genome

Mamm Genome. 2021 Apr;32(2):123-128. doi: 10.1007/s00335-021-09865-5. Epub 2021 Mar 15.

Abstract

The distribution of nucleotides spacing in human genome was investigated. An analysis of the frequency of occurrence in the human genome of different sequence lengths flanked by one type of nucleotide was carried out showing that the distribution has no self-similar (fractal) structure. The results nevertheless revealed several characteristic features: (i) the distribution for short-range spacing is quite similar to the purely stochastic sequences; (ii) the distribution for long-range spacing essentially deviates from the random sequence distribution, showing strong long-range correlations; (iii) the differences between (A, T) and (C, G) nucleotides are quite significant; (iv) the spacing distribution displays tiny oscillations.

MeSH terms

  • Algorithms
  • Base Composition*
  • Genome, Human*
  • Genomics* / methods
  • Humans
  • Models, Theoretical
  • Nucleotides*

Substances

  • Nucleotides