Ensemble-Learning and Feature Selection Techniques for Enhanced Antisense Oligonucleotide Efficacy Prediction in Exon Skipping

Pharmaceutics. 2023 Jun 24;15(7):1808. doi: 10.3390/pharmaceutics15071808.

Abstract

Antisense oligonucleotide (ASO)-mediated exon skipping has become a valuable tool for investigating gene function and developing gene therapy. Machine-learning-based computational methods, such as eSkip-Finder, have been developed to predict the efficacy of ASOs via exon skipping. However, these methods are computationally demanding, and the accuracy of predictions remains suboptimal. In this study, we propose a new approach to reduce the computational burden and improve the prediction performance by using feature selection within machine-learning algorithms and ensemble-learning techniques. We evaluated our approach using a dataset of experimentally validated exon-skipping events, dividing it into training and testing sets. Our results demonstrate that using a three-way-voting approach with random forest, gradient boosting, and XGBoost can significantly reduce the computation time to under ten seconds while improving prediction performance, as measured by R2 for both 2'-O-methyl nucleotides (2OMe) and phosphorodiamidate morpholino oligomers (PMOs). Additionally, the feature importance ranking derived from our approach is in good agreement with previously published results. Our findings suggest that our approach has the potential to enhance the accuracy and efficiency of predicting ASO efficacy via exon skipping. It could also facilitate the development of novel therapeutic strategies. This study could contribute to the ongoing efforts to improve ASO design and optimize gene therapy approaches.

Keywords: RNA; antisense oligonucleotides; ensemble learning; exon skipping; genetic disease; machine learning; n-of-1 therapy; personalized medicine; splice switching; splicing.

Grants and funding

This research was supported by the The Friends of Garrett Cumming Research and Muscular Dystrophy Canada Research Chair Fund, Women and Children’s Health Research Institute (WCHRI), Canadian Institutes of Health Research (CIHR), Intramural Research Grant (Grant number 2-6) for National Center of Neurology and Psychiatry (NCNP).