Analysis of motorcycle accidents using association rule mining-based framework with parameter optimization and GIS technology

J Safety Res. 2020 Dec:75:292-309. doi: 10.1016/j.jsr.2020.09.004. Epub 2020 Sep 26.

Abstract

Introduction: Analyzing key factors of motorcycle accidents is an effective method to reduce fatalities and improve road safety. Association Rule Mining (ARM) is an efficient data mining method to identify critical factors associated with injury severity. However, the existing studies have some limitations in applying ARM: (a) Most studies determined parameter thresholds of ARM subjectively, which lacks objectiveness and efficiency; (b) Most studies only listed rules with high parameter thresholds, while lacking in-depth analysis of multiple-item rules. Besides, the existing studies seldom conducted a spatial analysis of motorcycle accidents, which can provide intuitive suggestions for policymakers.

Method: To address these limitations, this study proposes an ARM-based framework to identify critical factors related to motorcycle injury severity. A method for parameter optimization is proposed to objectively determine parameter thresholds in ARM. A method of factor extraction is proposed to identify individual key factors from 2-item rules and boosting factors from multiple-item rules. Geographic information system (GIS) is adopted to explore the spatial relationship between key factors and motorcycle injury severity.

Results and conclusions: The framework is applied to a case study of motorcycle accidents in Victoria, Australia. Fifteen attributes are selected after data preprocessing. 0.03 and 0.7 are determined as the best thresholds of support and confidence in ARM. Five individual key factors and four boosting factors are identified to be related to fatal injury. Spatial analysis is conducted by GIS to present hot spots of motorcycle accidents. The proposed framework has been validated to have better performance on parameter optimization and rule analysis in ARM. Practical applications: The hot spots of motorcycle accidents related to fatal factors are presented in GIS maps. Policymakers can refer to those maps straightforwardly when decision making. This framework can be applied to various kinds of traffic accidents to improve the performance of severity analysis.

Keywords: Accurate and Efficient Classification Based on Multiple Class-Association Rules (CMAR); Association Rule Mining (ARM); Geographic Information System (GIS); Key Factors; Motorcycle Accidents; threshold determination.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Accidents, Traffic / statistics & numerical data*
  • Adult
  • Aged
  • Aged, 80 and over
  • Data Mining / standards
  • Female
  • Geographic Information Systems / statistics & numerical data
  • Humans
  • Male
  • Middle Aged
  • Motorcycles*
  • Victoria
  • Young Adult