Fitting prediction rule ensembles to psychological research data: An introduction and tutorial

Psychol Methods. 2020 Oct;25(5):636-652. doi: 10.1037/met0000256. Epub 2020 Feb 10.

Abstract

Prediction rule ensembles (PREs) are a relatively new statistical learning method, which aim to strike a balance between predictive performance and interpretability. Starting from a decision tree ensemble, like a boosted tree ensemble or a random forest, PREs retain a small subset of tree nodes in the final predictive model. These nodes can be written as simple rules of the form if [condition] then [prediction]. As a result, PREs are often much less complex than full decision tree ensembles, while they have been found to provide similar predictive performance in many situations. The current article introduces the methodology and shows how PREs can be fitted using the R package pre through several real-data examples from psychological research. The examples also illustrate a number of features of package pre that may be particularly useful for applications in psychology: support for categorical, multivariate and count responses, application of (non)negativity constraints, inclusion of confirmatory rules and standardized variable importance measures. (PsycInfo Database Record (c) 2020 APA, all rights reserved).

MeSH terms

  • Adolescent
  • Adult
  • Anxiety Disorders / diagnosis
  • Biomedical Research / methods*
  • Depressive Disorder / diagnosis
  • Disease Progression
  • Female
  • Humans
  • Machine Learning*
  • Male
  • Middle Aged
  • Prognosis
  • Psychology / methods*
  • Statistics as Topic*
  • Young Adult