Quasi-species nature and differential gene expression of severe acute respiratory syndrome coronavirus 2 and phylogenetic analysis of a novel Iranian strain

Infect Genet Evol. 2020 Nov:85:104556. doi: 10.1016/j.meegid.2020.104556. Epub 2020 Sep 13.

Abstract

A novel coronavirus related to severe acute respiratory syndrome virus, (SARS-CoV-2) is the causal agent of the COVID-19 pandemic. Despite the genetic mutations across the SARS-CoV-2 genome being recently investigated, its transcriptomic genetic polymorphisms at inter-host level and the viral gene expression level based on each Open Reading Frame (ORF) remains unclear. Using available High Throughput Sequencing (HTS) data and based on SARS-CoV-2 infected human transcriptomic data, this study presents a high-resolution map of SARS-CoV-2 single nucleotide polymorphism (SNP) hotspots in a viral population at inter-host level. Four throat swab samples from COVID-19 infected patients were pooled, with RNA-Seq read retrieved from SRA NCBI to detect 21 SNPs and a replacement across the SARS-CoV-2 genomic population. Twenty-two RNA modification sites on viral transcripts were identified that may cause inter-host genetic diversity of this virus. In addition, the canonical genomic RNAs of N ORF showed higher expression in transcriptomic data and reverse transcriptase quantitative PCR compared to other SARS-CoV-2 ORFs, indicating the importance of this ORF in virus replication or other major functions in virus cycle. Phylogenetic and ancestral sequence analyses based on the entire genome revealed that SARS-CoV-2 is possibly derived from a recombination event between SARS-CoV and Bat SARS-like CoV. Ancestor analysis of the isolates from different locations including Iran suggest shared Chinese ancestry. These results propose the importance of potential inter-host level genetic variations to the evolution of SARS-COV-2, and the formation of viral quasi-species. The RNA modifications discovered in this study may cause amino acid sequence changes in polyprotein, spike protein, product of ORF8 and nucleocapsid (N) protein, suggesting further insights to understanding the functional impacts of mutations in the life cycle and pathogenicity of SARS-CoV-2.

Keywords: Inter-host; Nucleocapsid; Open reading frame; Phylogenetic; SARS-CoV-2; Single nucleotide polymorphism.

MeSH terms

  • COVID-19 / genetics
  • COVID-19 / virology*
  • Gene Expression Profiling / methods*
  • Gene Expression Regulation, Viral
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Iran
  • Pharynx / virology
  • Phylogeny
  • Polymorphism, Single Nucleotide*
  • Quasispecies
  • SARS-CoV-2 / classification*
  • SARS-CoV-2 / genetics
  • Sequence Analysis, RNA
  • Viral Proteins / genetics*
  • Virus Replication

Substances

  • Viral Proteins