Latent periodicity-2 in coronavirus SARS-CoV-2 genome: Evolutionary implications

J Theor Biol. 2021 Apr 21:515:110604. doi: 10.1016/j.jtbi.2021.110604. Epub 2021 Jan 26.

Abstract

The ongoing global pandemic of infection disease COVID-19 caused by the 2019 novel coronavirus (SARS-COV-2, formerly 2019-nCoV) presents critical threats to public health and the economy. The genome of SARS-CoV-2 had been sequenced and structurally annotated, yet little is known of the intrinsic organization and evolution of the genome. To this end, we present a mathematical method for the genomic spectrum, a kind of barcode, of SARS-CoV-2 and common human coronaviruses. The genomic spectrum is constructed according to the periodic distributions of nucleotides and therefore reflects the unique characteristics of the genome. The results demonstrate that coronavirus SARS-CoV-2 exhibits predominant latent periodicity-2 regions of non-structural proteins 3, 4, 5, and 6. Further analysis of the latent periodicity-2 regions suggests that the dinucleotide imbalances are increased during evolution and may confer the evolutionary fitness of the virus. Especially, SARS-CoV-2 isolates have increased latent periodicity-2 and periodicity-3 during COVID-19 pandemic. The special strong periodicity-2 regions and the intensity of periodicity-2 in the SARS-CoV-2 whole genome may become diagnostic and pharmaceutical targets in monitoring and curing the COVID-19 disease.

Keywords: 2019-nCoV; COVID-19; Dinucleotide; Evolution fitness; Periodicity; SARS-CoV-2; Virulence.

Publication types

  • Historical Article

MeSH terms

  • Base Sequence
  • COVID-19 / epidemiology
  • COVID-19 / virology
  • DNA Barcoding, Taxonomic / methods
  • Evolution, Molecular*
  • Genome, Viral* / genetics
  • Genomics
  • History, 21st Century
  • Humans
  • Models, Theoretical*
  • Open Reading Frames / genetics
  • Pandemics
  • Period Circadian Proteins / genetics*
  • Phylogeny
  • RNA, Viral / genetics
  • SARS-CoV-2 / genetics*
  • SARS-CoV-2 / pathogenicity
  • Sequence Analysis, DNA
  • Virulence / genetics*

Substances

  • Period Circadian Proteins
  • RNA, Viral