Support Vector Machine as a Supervised Learning for the Prioritization of Novel Potential SARS-CoV-2 Main Protease Inhibitors

Int J Mol Sci. 2021 Jul 19;22(14):7714. doi: 10.3390/ijms22147714.

Abstract

In the last year, the COVID-19 pandemic has highly affected the lifestyle of the world population, encouraging the scientific community towards a great effort on studying the infection molecular mechanisms. Several vaccine formulations are nowadays available and helping to reach immunity. Nevertheless, there is a growing interest towards the development of novel anti-covid drugs. In this scenario, the main protease (Mpro) represents an appealing target, being the enzyme responsible for the cleavage of polypeptides during the viral genome transcription. With the aim of sharing new insights for the design of novel Mpro inhibitors, our research group developed a machine learning approach using the support vector machine (SVM) classification. Starting from a dataset of two million commercially available compounds, the model was able to classify two hundred novel chemo-types as potentially active against the viral protease. The compounds labelled as actives by SVM were next evaluated through consensus docking studies on two PDB structures and their binding mode was compared to well-known protease inhibitors. The best five compounds selected by consensus docking were then submitted to molecular dynamics to deepen binding interactions stability. Of note, the compounds selected via SVM retrieved all the most important interactions known in the literature.

Keywords: COVID-19; classification; machine learning; main protease; molecular docking.

MeSH terms

  • Antiviral Agents / pharmacology
  • COVID-19 / virology
  • COVID-19 Drug Treatment*
  • Coronavirus Protease Inhibitors / metabolism
  • Coronavirus Protease Inhibitors / pharmacology*
  • Databases, Pharmaceutical
  • Drug Evaluation, Preclinical / methods*
  • Humans
  • Molecular Docking Simulation
  • Molecular Dynamics Simulation
  • Pandemics
  • SARS-CoV-2 / drug effects*
  • SARS-CoV-2 / enzymology
  • Small Molecule Libraries
  • Supervised Machine Learning
  • Support Vector Machine*
  • Viral Nonstructural Proteins / metabolism
  • Viral Proteases / metabolism

Substances

  • Antiviral Agents
  • Coronavirus Protease Inhibitors
  • Small Molecule Libraries
  • Viral Nonstructural Proteins
  • Viral Proteases