Machine learning-derived model for predicting poor post-treatment quality of life in Korean cancer survivors

Yu Hyeon Choe; Sujee Lee; Yooseok Lim; Soo Hyun Kim

doi:10.1007/s00520-024-08347-z

Machine learning-derived model for predicting poor post-treatment quality of life in Korean cancer survivors

Support Care Cancer. 2024 Feb 5;32(3):143. doi: 10.1007/s00520-024-08347-z.

Authors

Yu Hyeon Choe^#¹, Sujee Lee^#², Yooseok Lim², Soo Hyun Kim³

Affiliations

¹ Department of Nursing, Inha University, Incheon, Republic of Korea.
² Department of Industrial and Information Systems Engineering, Soongsil University, Seoul, Republic of Korea.
³ Department of Nursing, Inha University, Incheon, Republic of Korea. soohyun@inha.ac.kr.

^# Contributed equally.

PMID: 38315224
DOI: 10.1007/s00520-024-08347-z

Abstract

Purpose: A substantial number of cancer survivors have poor quality of life (QOL) even after completing cancer treatment. Thus, in this study, we used machine learning (ML) to develop predictive models for poor QOL in post-treatment cancer survivors in South Korea.

Methods: This cross-sectional study used online survey data from 1,005 post-treatment cancer survivors in South Korea. The outcome variable was QOL, which was measured using the global QOL subscale of the European Organization of Cancer and Treatment for Cancer Quality of Life Questionnaire, where a global QOL score < 60.4 was defined as poor QOL. Three ML models (random forest (RF), support vector machine, and extreme gradient boosting) and three deep learning models were used to develop predictive models for poor QOL. Model performance regarding accuracy, area under the receiver operating characteristic curve, F1 score, precision, and recall was evaluated. The SHapely Additive exPlanation (SHAP) method was used to identify important features.

Results: Of the 1,005 participants, 65.1% had poor QOL. Among the six models, the RF model had the best performance (accuracy = 0.85, F1 = 0.90). The SHAP method revealed that survivorship concerns (e.g., distress, pain, and fatigue) were the most important factors that affected poor QOL.

Conclusions: The ML-based prediction model developed to predict poor QOL in Korean post-treatment cancer survivors showed good accuracy. The ML model proposed in this study can be used to support clinical decision-making in identifying survivors at risk of poor QOL.

Keywords: Cancer survivors; Machine learning; Prediction model; Quality of life.

MeSH terms

Cancer Survivors*
Cross-Sectional Studies
Humans
Machine Learning
Neoplasms* / therapy
Quality of Life
Republic of Korea

Grants and funding

2022R1A2C1092797/National Research Foundation of Korea grant funded by the Korean government Ministry of Science and ICT