Defining machine learning algorithms as accident prediction models for Italian two-lane rural, suburban, and urban roads

Int J Inj Contr Saf Promot. 2022 Dec;29(4):450-462. doi: 10.1080/17457300.2022.2075397. Epub 2022 May 25.

Abstract

Four Accident Prediction Models have been defined for Italian two-lane rural, suburban, and urban roads by exploiting different Machine Learning Algorithms. Specifically, a Classification and Regression Tree, a Boosted Regression Tree, a Random Forest, and a Support Vector Machine have been implemented to predict the number of Fatal and Injury crashes on a 905-km network, which experienced 5,802 FI crashes in 2008-2016. The dataset incorporates geometrical, functional, and environmental information. Several performance metrics have been computed, such as Determination Coefficient, Mean Absolute Error, Root Mean Square Error, and scatterplots. Outcomes suggest that Support Vector Machine outperforms the other Machine Learning Algorithms for predicting Fatal and Injury crashes. In Addition, the computation of Predictor Importance shows that traffic flow, the density of intersections, driveway density, and type of area are the most impacting factors on crash likelihood. Road authorities may use these findings for conducting reliable safety analyses.

Keywords: Accident prediction models; Fatal and injury crashes; crash prediction models; machine learning algorithms; road safety; two-lane roads.

MeSH terms

  • Accidents, Traffic*
  • Algorithms
  • Humans
  • Italy
  • Machine Learning
  • Models, Statistical*