A data-driven Bayesian network for probabilistic crash risk assessment of individual driver with traffic violation and crash records

Accid Anal Prev. 2022 Oct:176:106790. doi: 10.1016/j.aap.2022.106790. Epub 2022 Aug 4.

Abstract

In recent years, individual drivers' crash risk assessments have received much attention for identifying high-risk drivers. To this end, we propose a probabilistic assessment method of crash risks with a reproducible long-term dataset (i.e., traffic violations, license, and crash records). In developing this method, we used 7.75 million violations and crashes of 5.5 million individual drivers in Seoul, South Korea, from June 2013 to June 2017 (four years). The stochastic process of the Bayesian network (BN), whose structure is optimized by tabu-search, successfully evaluates individual drivers' crash and violation probability. In addition, the cluster analysis classifies drivers into five distinctive groups according to their estimated violation and crash probabilities. As a result, this study found that the estimated average crash rate within a cluster converges with the actual crash rate by the proposed framework without privacy issues. We also confirm that violation records and expected crash probability are strongly correlated, and there is a direct relationship between a driver's previous violations and crash record and the future at-fault crash. The proposed assessment method is valuable in developing proactive driver education programs and safety countermeasures, including adjusting the penalty system and developing user-based insurance by recognizing dangerous drivers and identifying their properties.

Keywords: Bayesian network; Crash risk assessment; Probabilistic graphical model; Traffic crash; Traffic violation.

MeSH terms

  • Accidents, Traffic / prevention & control
  • Automobile Driving*
  • Bayes Theorem
  • Humans
  • Licensure
  • Risk Assessment / methods