G-Quadruplexes Involving Both Strands of Genomic DNA Are Highly Abundant and Colocalize with Functional Sites in the Human Genome

PLoS One. 2016 Jan 4;11(1):e0146174. doi: 10.1371/journal.pone.0146174. eCollection 2016.

Abstract

The G-quadruplex is a non-canonical DNA structure biologically significant in DNA replication, transcription and telomere stability. To date, only G4s with all guanines originating from the same strand of DNA have been considered in the context of the human nuclear genome. Here, I discuss interstrand topological configurations of G-quadruplex DNA, consisting of guanines from both strands of genomic DNA; an algorithm is presented for predicting such structures. I have identified over 550,000 non-overlapping interstrand G-quadruplex forming sequences in the human genome--significantly more than intrastrand configurations. Functional analysis of interstrand G-quadruplex sites shows strong association with transcription initiation, the results are consistent with the XPB and XPD transcriptional helicases binding only to G-quadruplex DNA with interstrand topology. Interstrand quadruplexes are also enriched in origin of replication sites. Several topology classes of interstrand quadruplex-forming sequences are possible, and different topologies are enriched in different types of structural elements. The list of interstrand quadruplex forming sequences, and the computer program used for their prediction are available at the web address http://moment.utmb.edu/allquads.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • DNA / chemistry
  • DNA, Single-Stranded / chemistry
  • G-Quadruplexes*
  • Genome, Human*
  • Guanine / chemistry
  • Humans
  • Transcription Initiation, Genetic

Substances

  • DNA, Single-Stranded
  • Guanine
  • DNA