Multivariate random-parameters zero-inflated negative binomial regression model: an application to estimate crash frequencies at intersections

Accid Anal Prev. 2014 Sep:70:320-9. doi: 10.1016/j.aap.2014.04.018. Epub 2014 May 17.

Abstract

Crash data are collected through police reports and integrated with road inventory data for further analysis. Integrated police reports and inventory data yield correlated multivariate data for roadway entities (e.g., segments or intersections). Analysis of such data reveals important relationships that can help focus on high-risk situations and coming up with safety countermeasures. To understand relationships between crash frequencies and associated variables, while taking full advantage of the available data, multivariate random-parameters models are appropriate since they can simultaneously consider the correlation among the specific crash types and account for unobserved heterogeneity. However, a key issue that arises with correlated multivariate data is the number of crash-free samples increases, as crash counts have many categories. In this paper, we describe a multivariate random-parameters zero-inflated negative binomial (MRZINB) regression model for jointly modeling crash counts. The full Bayesian method is employed to estimate the model parameters. Crash frequencies at urban signalized intersections in Tennessee are analyzed. The paper investigates the performance of MZINB and MRZINB regression models in establishing the relationship between crash frequencies, pavement conditions, traffic factors, and geometric design features of roadway intersections. Compared to the MZINB model, the MRZINB model identifies additional statistically significant factors and provides better goodness of fit in developing the relationships. The empirical results show that MRZINB model possesses most of the desirable statistical properties in terms of its ability to accommodate unobserved heterogeneity and excess zero counts in correlated data. Notably, in the random-parameters MZINB model, the estimated parameters vary significantly across intersections for different crash types.

Keywords: Crash frequency; Full Bayesian method; Geometric design; MRZINB model.

Publication types

  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Accidents, Traffic / prevention & control
  • Accidents, Traffic / statistics & numerical data*
  • Bayes Theorem
  • Cities
  • Environment Design*
  • Humans
  • Models, Statistical*
  • Multivariate Analysis
  • Regression Analysis
  • Safety / statistics & numerical data*
  • Tennessee