Sars-escape network for escape prediction of SARS-COV-2

Brief Bioinform. 2023 May 19;24(3):bbad140. doi: 10.1093/bib/bbad140.

Abstract

Motivation: Viruses have coevolved with their hosts for over millions of years and learned to escape the host's immune system. Although not all genetic changes in viruses are deleterious, some significant mutations lead to the escape of neutralizing antibodies and weaken the immune system, which increases infectivity and transmissibility, thereby impeding the development of antiviral drugs or vaccines. Accurate and reliable identification of viral escape mutational sequences could be a good indicator for therapeutic design. We developed a computational model that recognizes significant mutational sequences based on escape feature identification using natural language processing along with prior knowledge of experimentally validated escape mutants.

Results: Our machine learning-based computational approach can recognize the significant spike protein sequences of severe acute respiratory syndrome coronavirus 2 using sequence data alone. This modelling approach can be applied to other viruses, such as influenza, monkeypox and HIV using knowledge of escape mutants and relevant protein sequence datasets.

Availability: Complete source code and pre-trained models for escape prediction of severe acute respiratory syndrome coronavirus 2 protein sequences are available on Github at https://github.com/PremSinghBist/Sars-CoV-2-Escape-Model.git. The dataset is deposited to Zenodo at: doi: 10.5281/zenodo.7142638. The Python scripts are easy to run and customize as needed.

Contact: premsing212@jbnu.ac.kr.

Keywords: SARS-CoV-2; mutation; sequence analysis; viral escape prediction.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Antibodies, Neutralizing
  • COVID-19* / genetics
  • Humans
  • Mutation
  • SARS-CoV-2* / genetics

Substances

  • Antibodies, Neutralizing