Topological Analysis for Sequence Variability: Case Study on more than 2K SARS-CoV-2 sequences of COVID-19 infected 54 countries in comparison with SARS-CoV-1 and MERS-CoV

Infect Genet Evol. 2021 Mar:88:104708. doi: 10.1016/j.meegid.2021.104708. Epub 2021 Jan 6.

Abstract

The pandemic due to novel coronavirus, SARS-CoV-2 is a serious global concern now. More than thousand new COVID-19 infections are getting reported daily for this virus across the globe. Thus, the medical research communities are trying to find the remedy to restrict the spreading of this virus, while the vaccine development work is still under research in parallel. In such critical situation, not only the medical research community, but also the scientists in different fields like microbiology, pharmacy, bioinformatics and data science are also sharing effort to accelerate the process of vaccine development, virus prediction, forecasting the transmissible probability and reproduction cases of virus for social awareness. With the similar context, in this article, we have studied sequence variability of the virus primarily focusing on three aspects: (a) sequence variability among SARS-CoV-1, MERS-CoV and SARS-CoV-2 in human host, which are in the same coronavirus family, (b) sequence variability of SARS-CoV-2 in human host for 54 different countries and (c) sequence variability between coronavirus family and country specific SARS-CoV-2 sequences in human host. For this purpose, as a case study, we have performed topological analysis of 2391 global genomic sequences of SARS-CoV-2 in association with SARS-CoV-1 and MERS-CoV using an integrated semi-alignment based computational technique. The results of the semi-alignment based technique are experimentally and statistically found similar to alignment based technique and computationally faster. Moreover, the outcome of this analysis can help to identify the nations with homogeneous SARS-CoV-2 sequences, so that same vaccine can be applied to their heterogeneous human population.

Keywords: COVID-19; Coronavirsus; Genomic sequences; Global health; MERS-CoV; SARS-CoV-2; Sequence alignment free technique.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Africa / epidemiology
  • Americas / epidemiology
  • Asia / epidemiology
  • Australia / epidemiology
  • Base Sequence
  • COVID-19 / epidemiology*
  • COVID-19 / transmission
  • COVID-19 / virology
  • Computational Biology / methods
  • Coronavirus Infections / epidemiology*
  • Coronavirus Infections / transmission
  • Coronavirus Infections / virology
  • Europe / epidemiology
  • Genetic Variation*
  • Genome, Viral*
  • Host-Pathogen Interactions / genetics
  • Humans
  • Middle East Respiratory Syndrome Coronavirus / genetics
  • Middle East Respiratory Syndrome Coronavirus / pathogenicity
  • Pandemics*
  • SARS-CoV-2 / genetics*
  • SARS-CoV-2 / pathogenicity
  • Sequence Alignment
  • Severe Acute Respiratory Syndrome / epidemiology*
  • Severe Acute Respiratory Syndrome / transmission
  • Severe Acute Respiratory Syndrome / virology
  • Severe acute respiratory syndrome-related coronavirus / genetics
  • Severe acute respiratory syndrome-related coronavirus / pathogenicity