In recent decades, the advancement of computational algorithms and the availability of big data have enabled artificial intelligence (AI) to dramatically improve predictive performance in nearly all research areas. Specifically, machine learning (ML) techniques, a major branch of AI, have been widely used in many tasks of drug discovery and development, including predicting treatment effects, identifying target genes and functional pathways, as well as selecting potential biomarkers. However, in practice, blindly applying ML methods may lead to common pitfalls, including overfitting and lack of generalizability. Therefore, how to improve the robustness and prediction accuracy of ML methods has become a crucial problem for researchers. In this review, we summarize the application of ML models to drug discovery by introducing the top-performing methods developed from large-scale drug-related data challenges in recent years.
Keywords: artificial intelligence; data challenge; drug discovery; machine learning.