Relevant SARS-CoV-2 Genome Variation through Six Months of Worldwide Monitoring

Biomed Res Int. 2021 Jun 29:2021:5553173. doi: 10.1155/2021/5553173. eCollection 2021.

Abstract

Real-time genome monitoring of the SARS-CoV-2 pandemic outbreak is of utmost importance for designing diagnostic tools, guiding antiviral treatment and vaccination strategies. In this study, we present an accurate method for temporal and geographical comparison of mutational events based on GISAID database genome sequencing. Among 42523 SARS-CoV-2 genomes analyzed, we found 23202 variants compared to the reference genome. The Ti/Tv (transition/transversion) ratio was used to filter out possible false-positive errors. Transition mutations generally occurred more frequently than transversions. Our clustering analysis revealed remarkable hotspot mutation patterns for SARS-CoV-2. Mutations were clustered based on how their frequencies changed over time according to each geographical location. We observed some clusters showing a clear variation in mutation frequency and continuously evolving in the world. However, many mutations appeared in specific periods without a clear pattern over time. Various important nonsynonymous mutations were observed, mainly in Oceania and Asia. More than half of these mutations were observed only once. Four hotspot mutations were found in all geographical locations at least once: T265I (NSP2), P314L (NSP12), D614G (S), and Q57H (ORF3a). The current analysis of SARS-CoV-2 genomes provides valuable information on the geographical and temporal mutational evolution of SARS-CoV-2.

MeSH terms

  • COVID-19* / epidemiology
  • COVID-19* / genetics
  • Databases, Nucleic Acid*
  • Evolution, Molecular*
  • Genome, Viral*
  • Humans
  • Mutation*
  • Pandemics*
  • Phylogeny*
  • SARS-CoV-2 / genetics*