Relating SARS-CoV-2 variants using cellular automata imaging

Sci Rep. 2022 Jun 18;12(1):10297. doi: 10.1038/s41598-022-14404-6.

Abstract

We classify the main variants of the SARS-CoV-2 virus representing a given biological sequence coded as a symbolic digital sequence and by its evolution by a cellular automata with a properly chosen rule. The spike protein, common to all variants of the SARS-CoV-2 virus, is then by the picture of the cellular automaton evolution yielding a visible representation of important features of the protein. We use information theory Hamming distance between different stages of the evolution of the cellular automaton for seven variants relative to the original Wuhan/China virus. We show that our approach allows to classify and group variants with common ancestors and same mutations. Although being a simpler method, it can be used as an alternative for building phylogenetic trees.

MeSH terms

  • COVID-19* / genetics
  • Cellular Automata
  • Genome, Viral
  • Humans
  • Mutation
  • Phylogeny
  • SARS-CoV-2* / genetics

Supplementary concepts

  • SARS-CoV-2 variants