Integration of model-based recursive partitioning with bias reduction estimation: a case study assessing the impact of Oliver's four factors on the probability of winning a basketball game

Adv Stat Anal. 2023;107(1-2):271-293. doi: 10.1007/s10182-022-00456-6. Epub 2022 Jul 4.

Abstract

In this contribution, we investigate the importance of Oliver's Four Factors, proposed in the literature to identify a basketball team's strengths and weaknesses in terms of shooting, turnovers, rebounding and free throws, as success drivers of a basketball game. In order to investigate the role of each factor in the success of a team in a match, we applied the MOdel-Based recursive partitioning (MOB) algorithm to real data concerning 19,138 matches of 16 National Basketball Association (NBA) regular seasons (from 2004-2005 to 2019-2020). MOB, instead of fitting one global Generalized Linear Model (GLM) to all observations, partitions the observations according to selected partitioning variables and estimates several ad hoc local GLMs for subgroups of observations. The manuscript's aim is twofold: (1) in order to deal with (quasi) separation problems leading to convergence problems in the numerical solution of Maximum Likelihood (ML) estimation in MOB, we propose a methodological extension of GLM-based recursive partitioning from standard ML estimation to bias-reduced (BR) estimation; and (2) we apply the BR-based GLM trees to basketball analytics. The results show models very easy to interpret that can provide useful support to coaching staff's decisions.

Supplementary information: The online version contains supplementary material available at 10.1007/s10182-022-00456-6.

Keywords: Basketball; Bias reduction GLM; Data mining; MOB; Sport analytics; Success drivers.