Potential G-quadruplexes and i-Motifs in the SARS-CoV-2

PLoS One. 2021 Jun 8;16(6):e0250654. doi: 10.1371/journal.pone.0250654. eCollection 2021.

Abstract

Quadruplex structures have been identified in a plethora of organisms where they play important functions in the regulation of molecular processes, and hence have been proposed as therapeutic targets for many diseases. In this paper we report the extensive bioinformatic analysis of the SARS-CoV-2 genome and related viruses using an upgraded version of the open-source algorithm G4-iM Grinder. This version improves the functionality of the software, including an easy way to determine the potential biological features affected by the candidates found. The quadruplex definitions of the algorithm were optimized for SARS-CoV-2. Using a lax quadruplex definition ruleset, which accepts amongst other parameters two residue G- and C-tracks, 512 potential quadruplex candidates were discovered. These sequences were evaluated by their in vitro formation probability, their position in the viral RNA, their uniqueness and their conservation rates (calculated in over seventeen thousand different COVID-19 clinical cases and sequenced at different times and locations during the ongoing pandemic). These results were then compared subsequently to other Coronaviridae members, other Group IV (+)ssRNA viruses and the entire viral realm. Sequences found in common with other viral species were further analyzed and characterized. Sequences with high scores unique to the SARS-CoV-2 were studied to investigate the variations amongst similar species. Quadruplex formation of the best candidates were then confirmed experimentally. Using NMR and CD spectroscopy, we found several highly stable RNA quadruplexes that may be suitable therapeutic targets for the SARS-CoV-2.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology
  • G-Quadruplexes*
  • Genome, Viral*
  • Guanine
  • Nucleotide Motifs*
  • RNA, Viral / genetics*
  • SARS-CoV-2 / genetics*

Substances

  • RNA, Viral
  • Guanine

Grants and funding

M.B-L & J.G: a. Grants: 1. NORTE-01-0145-FEDER-000019, 2. NORTE-01-0145-FEDER-031142 and 3. 0624_2IQBIONEURO_6_E. b. Funders: 1. 2014-2020 North Portugal Regional Operational Program (NORTE 2020) and the European Regional Development Fund (ERDF), 2. the Fundação para a Ciência e a Tecnoloxía (FCT), ERDF and NORTE 2020, 3. 2014-2020 INTERREG Cooperation Programme Spain–Portugal (POCTEP). c. URLS: 1. https://norte2020.pt, https://ec.europa.eu/regional_policy/en/funding/erdf/, 2. https://www.fct.pt, https://norte2020.pt, https://ec.europa.eu/regional_policy/en/funding/erdf/, 3. https://interreg.eu/programme/interreg-spain-portugal-poctep/ The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. C.G: a. Grants: BFU2017-89707-P b. Funders: Spanish Ministry of Science, Innovation and Universities (MCIU) c. URL: https://www.ciencia.gob.es d. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.