Prediction of antischistosomal small molecules using machine learning in the era of big data

Mol Divers. 2022 Jun;26(3):1597-1607. doi: 10.1007/s11030-021-10288-2. Epub 2021 Aug 5.

Abstract

Schistosomiasis is a neglected tropical disease caused by helminths of the Schistosoma genus. Despite its high morbidity and socio-economic burden, therapeutics are just a handful with praziquantel being the main drug. Praziquantel is an old drug registered for human use in 1982 and has since been administered en masse for chemotherapy, risking the development of resistance, thus the need for new drugs with different mechanisms of action. This review examines the use of machine learning (ML) in this era of big data to aid in the prediction of novel antischistosomal molecules. It first discusses the challenges of drug discovery in schistosomiasis. Explanations are then offered for big data, its characteristics and then, some open databases where large biochemical data on schistosomiasis can be obtained for ML model development are examined. The concepts of artificial intelligence, ML, and deep learning and their drug applications are explored in schistosomiasis. The use of binary classification in predicting antischistosomal compounds and some algorithms that have been applied including random forest and naive Bayesian are discussed. For this review, some deep learning algorithms (deep neural networks) are proposed as novel algorithms for predicting antischistosomal molecules via binary classification. Databases specifically designed for housing bioactivity data on antischistosomal molecules enriched with functional genomic datasets and ontologies are thus urgently needed for developing predictive ML models. This shows the application of machine learning techniques for the discovery of novel antischistosomal small molecules via binary classification in the era of big data.

Keywords: Artificial intelligence; Big data; Binary classification; Classifiers; Deep learning; Drug discovery; Machine learning; Schistosomiasis.

Publication types

  • Review

MeSH terms

  • Artificial Intelligence
  • Bayes Theorem
  • Big Data
  • Humans
  • Machine Learning
  • Praziquantel* / therapeutic use
  • Schistosomiasis* / drug therapy

Substances

  • Praziquantel