Improved Prediction of Knee Osteoarthritis by the Machine Learning Model XGBoost

Indian J Orthop. 2023 Jul 29;57(10):1667-1677. doi: 10.1007/s43465-023-00936-0. eCollection 2023 Oct.

Abstract

Objectives: The accurate prediction of osteoarthritis (OA) severity in patients can be helpful to make the proper decision of intervention. This study aims to build up a powerful model to assess predictive risk factors and severity of knee osteoarthritis (KOA) in the clinical scenario.

Methods: A total of 4796 KOA cases and 1205 features were selected by feature selections from the public OA database, Osteoarthritis Initiative (OAI). Six machine learning-based models were constructed and compared for the accuracy of OA prediction. The gradient-boosting decision tree was used to identify important prediction features in the extreme gradient boosting (XGBoost) model. The performance of models was evaluated by F1-score.

Results: Twenty features were determined as predictors for KOA risk and severity, including the subject characteristics, knee symptoms/risk factors and physical exam. The XGBoost model demonstrated 100% prediction accuracy for 54.7% of examined samples, and the remaining 45.3% of samples showed Kellgren and Lawrence (KL) gradings very close to the actual levels. It showed the highest prediction accuracy with an F1-score of 0.553 among the tested six models.

Conclusions: We demonstrate that the XGBoost is the best model for the prediction of KOA severity in the six examined models. In addition, 20 risk features were determined as the essential predictors of KOA, including the physical exam, knee symptoms/risk factors and subject characteristics, which may be useful for the identification of high-risk KOA cases and for making appropriate treatment decisions as well.

Keywords: KOA prediction; Machine learning; Risk factor; Severity assessment; XGBoost.