Sequence periodic pattern of HERV LTRs: a matrix simulation algorithm

J Biosci. 2012 Mar;37(1):19-24. doi: 10.1007/s12038-012-9182-x.

Abstract

Flanking regulatory long terminal repeats (LTRs) in Human endogenous retrovirus (HERV) is a kind of typical DNA repeat that is widespread in the human genome. Currently, many algorithms have been developed to detect the latent periodicity of a wide range of DNA repeats. However, no such attempt was made for HERV LTRs. The present study focused on the investigation of the possible sequence periodic patterns in the HERV LTRs and their regulatory mechanisms. We calculated the sequence periods of 5', 3' and combined LTRs in HERVs with our devised matrix simulation algorithm. It is interesting that 5' and 3' LTRs have the same period of 7, and combined LTRs have a period of 9. These results indicated that HERV LTRs have predominant periodic patterns. Based on the obtained sequence periodicity, we constructed periodic consensus sequences of 5', 3' and combined LTRs. As to 5' and 3' LTRs with the same period - 7, we manually scanned the nucleotide bases in the corresponding positions of their periodic consensus sequences, and found some positions have the nucleotide base unchanged, such as the 1st, 5th and 7th positions. These conservative nucleotide base positions represent critical binding sites of regulatory LTRs, and may be indicative of conserved regulatory mechanisms in LRT-participating regulatory networks.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Base Sequence / genetics*
  • Endogenous Retroviruses / genetics*
  • Genome Components / genetics*
  • Sequence Analysis, DNA / methods*
  • Terminal Repeat Sequences / genetics*