mikropml: User-Friendly R Package for Supervised Machine Learning Pipelines

J Open Source Softw. 2021;6(61):3073. doi: 10.21105/joss.03073. Epub 2021 May 14.

Abstract

Machine learning (ML) for classification and prediction based on a set of features is used to make decisions in healthcare, economics, criminal justice and more. However, implementing an ML pipeline including preprocessing, model selection, and evaluation can be time-consuming, confusing, and difficult. Here, we present mikropml (prononced "meek-ROPE em el"), an easy-to-use R package that implements ML pipelines using regression, support vector machines, decision trees, random forest, or gradient-boosted trees. The package is available on GitHub, CRAN, and conda.