Logistic Regression Model for a Bivariate Binomial Distribution with Applications in Baseball Data Analysis

Entropy (Basel). 2022 Aug 17;24(8):1138. doi: 10.3390/e24081138.

Abstract

There has been a considerable amount of literature on binomial regression models that utilize well-known link functions, such as logistic, probit, and complementary log-log functions. The conventional binomial model is focused only on a single parameter representing one probability of success. However, we often encounter data for which two different success probabilities are of interest simultaneously. For instance, there are several offensive measures in baseball to predict the future performance of batters. Under these circumstances, it would be meaningful to consider more than one success probability. In this article, we employ a bivariate binomial distribution that possesses two success probabilities to conduct a regression analysis with random effects being incorporated under a Bayesian framework. Major League Baseball data are analyzed to demonstrate our methodologies. Extensive simulation studies are conducted to investigate model performances.

Keywords: Metropolis–Hastings algorithm; bivariate binomial distribution; gibbs sampling; logistic regression; posterior mean; random effect.