One novel representation of DNA sequence based on the global and local position information

Sci Rep. 2018 May 15;8(1):7592. doi: 10.1038/s41598-018-26005-3.

Abstract

One novel representation of DNA sequence combining the global and local position information of the original sequence has been proposed to distinguish the different species. First, for the sufficient exploitation of global information, one graphical representation of DNA sequence has been formulated according to the curve of Fermat spiral. Then, for the consideration of local characteristics of DNA sequence, attaching each point in the curve of Fermat spiral with the related mass has been applied based on the relationships of neighboring four nucleotides. In this paper, the normalized moments of inertia of the curve of Fermat spiral which composed by the points with mass has been calculated as the numerical description of the corresponding DNA sequence on the first exons of beta-global genes. Choosing the Euclidean distance as the measurement of the numerical descriptions, the similarity between species has shown the performance of proposed method.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Animals
  • Base Sequence*
  • Computational Biology / methods*
  • Exons
  • Humans
  • Sequence Analysis, DNA / methods
  • Species Specificity
  • beta-Globins / genetics*

Substances

  • beta-Globins