Using contextual data to predict risky driving events: A novel methodology from explainable artificial intelligence

Accid Anal Prev. 2023 May:184:106997. doi: 10.1016/j.aap.2023.106997. Epub 2023 Feb 26.

Abstract

Usage-based insurance has allowed insurers to dynamically tailor insurance premiums by understanding when and how safe policyholders drive. However, telematics information can also be used to understand the driving contexts experienced by the driver within each trip (e.g., road types, weather, traffic). Since different combinations of these conditions affect exposure to accidents, this understanding introduces predictive opportunities in driving risk assessment. This paper investigates the relationships between driving context combinations and risk using a naturalistic driving dataset of 77,859 km. In particular, XGBoost and Random Forests are used to determine the predictive significance of driving contexts for near-misses, speeding and distraction events. Moreover, the most important contextual factors in predicting these risky events are identified and ranked through Shapley Additive Explanations. The results show that the driving context has significant power in predicting driving risk. Speed limit, weather temperature, wind speed, traffic conditions and road slope appear in the top ten most relevant features for most risky events. Analysing contextual feature variations and their influence on risky events showed that low-speed limits increase the predicted frequency of speeding and phone unlocking events, whereas high-speed limits decrease harsh accelerations. Low temperatures decrease the expected frequency of harsh manoeuvres, and precipitations increase harsh acceleration, harsh braking, and distraction events. Furthermore, road slope, intersections and pavement quality are the most critical factors among road layout attributes. The methodology presented in this study aims to support road safety stakeholders and insurers by providing insights to study the contextual risk factors that influence road accident frequency and driving risk.

Keywords: Driving context; Explainable AI; Machine learning; Risk assessment; Usage-based insurance.

MeSH terms

  • Accidents, Traffic* / prevention & control
  • Artificial Intelligence
  • Automobile Driving*
  • Humans
  • Risk Assessment
  • Risk Factors