A comprehensive analysis of genome composition and codon usage patterns of emerging coronaviruses

Virus Res. 2020 Jul 2:283:197976. doi: 10.1016/j.virusres.2020.197976. Epub 2020 Apr 12.

Abstract

An outbreak of atypical pneumonia caused by a novel Betacoronavirus (βCoV), named SARS-CoV-2 has been declared a public health emergency of international concern by the World Health Organization. In order to gain insight into the emergence, evolution and adaptation of SARS-CoV-2 viruses, a comprehensive analysis of genome composition and codon usage of βCoV circulating in China was performed. A biased nucleotide composition was found for SARS-CoV-2 genome. This bias in genomic composition is reflected in its codon and amino acid usage patterns. The overall codon usage in SARS-CoV-2 is similar among themselves and slightly biased. Most of the highly frequent codons are A- and U-ending, which strongly suggests that mutational bias is the main force shaping codon usage in this virus. Significant differences in relative synonymous codon usage frequencies among SARS-CoV-2 and human cells were found. These differences are due to codon usage preferences.

Keywords: 2019-nCoV; Codon usage; Coronavirus; Evolution; SARS-CoV-2; Wuhan.

Publication types

  • Comparative Study

MeSH terms

  • Amino Acids / genetics
  • Animals
  • Betacoronavirus / classification*
  • Betacoronavirus / genetics*
  • Betacoronavirus / isolation & purification
  • China / epidemiology
  • Chiroptera / virology
  • Codon Usage / genetics*
  • Communicable Diseases, Emerging / virology*
  • Coronavirus Infections / epidemiology
  • Coronavirus Infections / veterinary
  • Coronavirus Infections / virology
  • Evolution, Molecular
  • Ferrets / virology
  • Gene Expression Regulation, Viral / genetics*
  • Genome, Viral / genetics*
  • Genomics*
  • Humans
  • Mutagenesis / genetics
  • Open Reading Frames / genetics
  • SARS-CoV-2
  • Viverridae / virology

Substances

  • Amino Acids