Using machine learning to improve our understanding of injury risk and prediction in elite male youth football players

J Sci Med Sport. 2020 Nov;23(11):1044-1048. doi: 10.1016/j.jsams.2020.04.021. Epub 2020 May 18.

Abstract

Objectives: The purpose of this study was to examine whether the use of machine learning improved the ability of a neuromuscular screen to identify injury risk factors in elite male youth football players.

Design: Prospective cohort study.

Methods: 355 elite youth football players aged 10-18 years old completed a prospective pre-season neuromuscular screen that included anthropometric measures of size, as well as single leg countermovement jump (SLCMJ), single leg hop for distance (SLHD), 75% hop distance and stick (75%Hop), Y-balance anterior reach and tuck jump assessment. Injury incidence was monitored over one competitive season. Risk profiling was assessed using traditional regression analyses and compared to supervised machine learning algorithms constructed using decision trees.

Results: Using continuous data, multivariate logistic analysis identified SLCMJ asymmetry as the sole significant predictor of injury (OR 0.94, 0.92-0.97, p<0.001), with a specificity of 97.7% and sensitivity of 15.2% giving an AUC of 0.661. The best performing decision tree model provided a specificity of 74.2% and sensitivity of 55.6% with an AUC of 0.663. All variables contributed to the final machine model, with asymmetry in the SLCMJ, 75%Hop and Y-balance, plus tuck jump knee valgus and anthropometrics being the most frequent contributors.

Conclusions: Although both statistical methods reported similar accuracy, logistic regression provided very low sensitivity and only identified a single neuromuscular injury risk factor. The machine learning model provided much improved sensitivity to predict injury and identified interactions of asymmetry, knee valgus angle and body size as contributing factors to an injurious profile in youth football players.

Keywords: Binary logistic regression; Neuromuscular; Prospective; Screen.

MeSH terms

  • Adolescent
  • Athletic Injuries / diagnosis*
  • Child
  • England
  • Humans
  • Incidence
  • Machine Learning*
  • Male
  • Prospective Studies
  • Risk Factors
  • Sensitivity and Specificity
  • Soccer / injuries*