Prediction for the Risk of Multiple Chronic Conditions Among Working Population in the United States With Machine Learning Models

IEEE Open J Eng Med Biol. 2021 Oct 6:2:291-298. doi: 10.1109/OJEMB.2021.3117872. eCollection 2021.

Abstract

Objective: Chronic diseases have become the most prevalent and costly health conditions in the healthcare industry, deteriorating the quality of life, adversely affecting the work productivity, and costing astounding medical resources. However, few studies have been conducted on the predictive analysis of multiple chronic conditions (MCC) based on the working population. Results: Seven machine learning algorithms are used to support the decision making of healthcare practitioner on the risk of MCC. The models were developed and validated using checkup data from 451,425 working population collected by the healthcare providers. Our result shows that all proposed models achieved satisfactory performance, with the AUC values ranging from 0.826 to 0.850. Among the seven predictive models, the gradient boosting tree model outperformed other models, achieving an AUC of 0.850. Conclusions: Our risk prediction model shows great promise in automating real-time diagnosis, supporting healthcare practitioners to target high-risk individuals efficiently, and helping healthcare practitioners tailor proactive strategies to prevent the onset or delay the progression of the chronic diseases.

Keywords: Multiple chronic conditions; health informatics; machine learning; predictive analysis.