Railway accident prediction strategy based on ensemble learning

Accid Anal Prev. 2022 Oct:176:106817. doi: 10.1016/j.aap.2022.106817. Epub 2022 Aug 31.

Abstract

Railway accident prediction is of great significance for establishing an early warning mechanism and preventing the occurrences of accidents. Safety agencies rely on prediction models to design railroad risk management strategies. Based on historical railway accident data, an ensemble learning strategy for accident prediction is proposed. Firstly, an improved K-nearest neighbors (KNN) data imputation algorithm is proposed to solve the problem of missing data in the dataset. Then, to reduce the impact of imbalanced data on prediction performance, an AdaBoost-Bagging method is presented. Finally, according to the feature importance in the prediction model, accident features are ranked to identify new insights into the cause of the accident. The AdaBoost-Bagging prediction method is applied to the Federal Railroad Administration (FRA) dataset. The application results show that, compared with Artificial Neural Network (ANN), XGBoost, GBDT, Stacking and AdaBoost methods, AdaBoost-Bagging method has a smaller prediction error and faster inference time in predicting railway accidents. Accuracy, Precision, Recall and F1-score are 0.879, 0.879, 0.883 and 0.881 respectively, and the inference time is reduced by 23.38%, 12.15%, 6.66%, 3.17% and 11.41% respectively. The prediction method can well mine important features of railway accidents without knowing the accident mechanism or the relationship between various railway accidents and factors, e.g., the critic risk factors related to derailment and collision accidents are investigated in the prediction. The findings will be helpful to the prevention and management of railway accidents.

Keywords: Accident prediction; Accident prevention; AdaBoost; Bagging; Data imputation; Ensemble learning.

MeSH terms

  • Accidents, Traffic* / prevention & control
  • Algorithms
  • Humans
  • Machine Learning
  • Neural Networks, Computer
  • Railroads*