Severe acute respiratory syndrome Coronavirus-2 GenoAnalyzer and mutagenic anomaly detector using FCMFI and NSCE

Int J Biol Macromol. 2024 Feb;258(Pt 2):129051. doi: 10.1016/j.ijbiomac.2023.129051. Epub 2023 Dec 28.

Abstract

In order to deepen our understanding of the virus and help guide the creation of efficient therapies, this study uses artificial intelligence tools to thoroughly explore the genetic sequences of the SARS-CoV-2 virus. The process starts by using the Fuzzy Closure Miner for Frequent Itemsets (FCMFI) on a large corpus of SARS-CoV-2 genomic sequences to reveal hidden patterns, including nucleotides base sequences, repeating motifs, and corresponding interchanges. Then, using the Nucleotide Sequence Comprehension Engine (NSCE) technique, we were able to precisely define the genomic areas for mutation analysis. Structured and unstructured proteins are both strongly impacted by virus mutations, with spike proteins that are linked to the severity of COVID-19 pneumonia being particularly affected. Notably, the Mutagenic Anomaly Detector shows a 65 % efficiency boost in computing genome mutation rates compared to conventional point mutation analysis, while GenoAnalyzer offers a remarkable 93.33 % improvement over existing approaches in recognizing common genomic sequence patterns. These results highlight the potential of FCMFI to reveal complex genomic patterns and significant insights in COVID-19 genetic sequences when combined with mutation analysis. The Mutagenic Anomaly Detector and GenoAnalyzer show promise for revealing hidden genomic patterns and precisely estimating the SARS-CoV-2 mutation rate.

Keywords: FCMFI; Genome sequence; NSCE; Proteins; SARS-CoV-2.

MeSH terms

  • Artificial Intelligence
  • COVID-19*
  • Humans
  • Mutagens
  • Mutation
  • Phylogeny
  • SARS-CoV-2*

Substances

  • Mutagens