DECODING COMPLEXITY IN BIOMOLECULAR RECOGNITION OF DNA I-MOTIFS

bioRxiv [Preprint]. 2023 Apr 21:2023.04.19.537548. doi: 10.1101/2023.04.19.537548.

Abstract

DNA i-motifs (iMs) are non-canonical C-rich secondary structures implicated in numerous cellular processes. Though iMs exist throughout the genome, our understanding of iM recognition by proteins or small molecules is limited to a few examples. We designed a DNA microarray containing 10,976 genomic iM sequences to examine the binding profiles of four iM-binding proteins, mitoxantrone, and the iMab antibody. iMab microarray screens demonstrated that pH 6.5, 5% BSA buffer was optimal, and fluorescence was correlated with iM C-tract length. hnRNP K broadly recognizes diverse iM sequences, favoring 3-5 cytosine repeats flanked by thymine-rich loops of 1-3 nucleotides. Array binding mirrored public ChIP-Seq datasets, in which 35% of well-bound array iMs are enriched in hnRNP K peaks. In contrast, other reported iM-binding proteins had weaker binding or preferred G-quadruplex (G4) sequences instead. Mitoxantrone broadly binds both shorter iMs and G4s, consistent with an intercalation mechanism. These results suggest that hnRNP K may play a role in iM-mediated regulation of gene expression in vivo, whereas hnRNP A1 and ASF/SF2 are possibly more selective in their binding preferences. This powerful approach represents the most comprehensive investigation of how biomolecules selectively recognize genomic iMs to date.

Keywords: DNA recognition; DNA structures; fluorescent probes; high-throughput screening; i-motif.

Publication types

  • Preprint