Direct genome-wide identification of G-quadruplex structures by whole-genome resequencing

Nat Commun. 2021 Oct 14;12(1):6014. doi: 10.1038/s41467-021-26312-w.

Abstract

We present a user-friendly and transferable genome-wide DNA G-quadruplex (G4) profiling method that identifies G4 structures from ordinary whole-genome resequencing data by seizing the slight fluctuation of sequencing quality. In the human genome, 736,689 G4 structures were identified, of which 45.9% of all predicted canonical G4-forming sequences were characterized. Over 89% of the detected canonical G4s were also identified by combining polymerase stop assays with next-generation sequencing. Testing using public datasets of 6 species demonstrated that the present method is widely applicable. The detection rates of predicted canonical quadruplexes ranged from 32% to 58%. Because single nucleotide variations (SNVs) influence the formation of G4 structures and have individual differences, the given method is available to identify and characterize G4s genome-wide for specific individuals.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA
  • G-Quadruplexes*
  • Genetic Techniques*
  • Genome, Human
  • High-Throughput Nucleotide Sequencing / methods*
  • High-Throughput Screening Assays / methods
  • Humans
  • Precision Medicine
  • Whole Genome Sequencing / methods*

Substances

  • DNA