Structural investigation of pathogenic RFC1 AAGGG pentanucleotide repeats reveals a role of G-quadruplex in dysregulated gene expression in CANVAS

Nucleic Acids Res. 2024 Mar 21;52(5):2698-2710. doi: 10.1093/nar/gkae032.

Abstract

An expansion of AAGGG pentanucleotide repeats in the replication factor C subunit 1 (RFC1) gene is the genetic cause of cerebellar ataxia, neuropathy, and vestibular areflexia syndrome (CANVAS), and it also links to several other neurodegenerative diseases including the Parkinson's disease. However, the pathogenic mechanism of RFC1 AAGGG repeat expansion remains enigmatic. Here, we report that the pathogenic RFC1 AAGGG repeats form DNA and RNA parallel G-quadruplex (G4) structures that play a role in impairing biological processes. We determine the first high-resolution nuclear magnetic resonance (NMR) structure of a bimolecular parallel G4 formed by d(AAGGG)2AA and reveal how AAGGG repeats fold into a higher-order structure composed of three G-tetrad layers, and further demonstrate the formation of intramolecular G4s in longer DNA and RNA repeats. The pathogenic AAGGG repeats, but not the nonpathogenic AAAAG repeats, form G4 structures to stall DNA replication and reduce gene expression via impairing the translation process in a repeat-length-dependent manner. Our results provide an unprecedented structural basis for understanding the pathogenic mechanism of AAGGG repeat expansion associated with CANVAS. In addition, the high-resolution structures resolved in this study will facilitate rational design of small-molecule ligands and helicases targeting G4s formed by AAGGG repeats for therapeutic interventions.

MeSH terms

  • Cerebellar Ataxia*
  • DNA* / chemistry
  • DNA* / genetics
  • Gene Expression
  • Microsatellite Repeats*
  • Peripheral Nervous System Diseases*
  • RNA
  • Replication Protein C / genetics
  • Vestibular Diseases*

Substances

  • CANVAS syndrome
  • Replication Protein C
  • DNA
  • RNA