Mutational Asymmetries in the SARS-CoV-2 Genome May Lead to Increased Hydrophobicity of Virus Proteins

Genes (Basel). 2021 May 27;12(6):826. doi: 10.3390/genes12060826.

Abstract

The genomic diversity of SARS-CoV-2 has been a focus during the ongoing COVID-19 pandemic. Here, we analyzed the distribution and character of emerging mutations in a data set comprising more than 95,000 virus genomes covering eight major SARS-CoV-2 lineages in the GISAID database, including genotypes arising during COVID-19 therapy. Globally, the C>U transitions and G>U transversions were the most represented mutations, accounting for the majority of single-nucleotide variations. Mutational spectra were not influenced by the time the virus had been circulating in its host or medical treatment. At the amino acid level, we observed about a 2-fold excess of substitutions in favor of hydrophobic amino acids over the reverse. However, most mutations constituting variants of interests of the S-protein (spike) lead to hydrophilic amino acids, counteracting the global trend. The C>U and G>U substitutions altered codons towards increased amino acid hydrophobicity values in more than 80% of cases. The bias is explained by the existing differences in the codon composition for amino acids bearing contrasting biochemical properties. Mutation asymmetries apparently influence the biochemical features of SARS CoV-2 proteins, which may impact protein-protein interactions, fusion of viral and cellular membranes, and virion assembly.

Keywords: SARS-CoV-2; amino acid hydrophobicity; apolipoprotein B mRNA editing enzyme (APOBEC); coronavirus; evolution; genetic variation; mutability.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • APOBEC Deaminases
  • Alleles
  • Amino Acid Substitution
  • Amino Acids / chemistry
  • Amino Acids / genetics
  • COVID-19 / virology*
  • Evolution, Molecular
  • Genetic Variation
  • Genome, Viral*
  • Genotype
  • Host-Pathogen Interactions
  • Humans
  • Hydrophobic and Hydrophilic Interactions*
  • Mutation*
  • Phylogeny
  • Polymorphism, Single Nucleotide
  • Protein Binding
  • Protein Interaction Domains and Motifs
  • SARS-CoV-2 / genetics*
  • Spike Glycoprotein, Coronavirus / chemistry
  • Spike Glycoprotein, Coronavirus / genetics
  • Viral Proteins / chemistry*
  • Viral Proteins / genetics*

Substances

  • Amino Acids
  • Spike Glycoprotein, Coronavirus
  • Viral Proteins
  • APOBEC Deaminases