Characterizing Transcriptional Regulatory Sequences in Coronaviruses and Their Role in Recombination

Mol Biol Evol. 2021 Apr 13;38(4):1241-1248. doi: 10.1093/molbev/msaa281.

Abstract

Novel coronaviruses, including SARS-CoV-2, SARS, and MERS, often originate from recombination events. The mechanism of recombination in RNA viruses is template switching. Coronavirus transcription also involves template switching at specific regions, called transcriptional regulatory sequences (TRS). It is hypothesized but not yet verified that TRS sites are prone to recombination events. Here, we developed a tool called SuPER to systematically identify TRS in coronavirus genomes and then investigated whether recombination is more common at TRS. We ran SuPER on 506 coronavirus genomes and identified 465 TRS-L and 3,509 TRS-B. We found that the TRS-L core sequence (CS) and the secondary structure of the leader sequence are generally conserved within coronavirus genera but different between genera. By examining the location of recombination breakpoints with respect to TRS-B CS, we observed that recombination hotspots are more frequently colocated with TRS-B sites than expected.

Keywords: Key words: coronavirus; recombination; transcriptional regulatory sequences (TRS).

Publication types

  • Research Support, N.I.H., Intramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Coronavirus / genetics*
  • Genome, Viral*
  • Models, Genetic
  • Nucleic Acid Conformation
  • Phylogeny
  • RNA, Messenger / genetics
  • RNA, Viral / genetics*
  • RNA-Seq
  • Recombination, Genetic*
  • Regulatory Sequences, Nucleic Acid*
  • SARS-CoV-2 / genetics*
  • Transcription, Genetic

Substances

  • RNA, Messenger
  • RNA, Viral