Age-Stratified Analysis of COVID-19 Outcome Using Machine Learning Predictive Models

Healthcare (Basel). 2022 Oct 14;10(10):2027. doi: 10.3390/healthcare10102027.

Abstract

Since the emergence of COVID-19, most health systems around the world have experienced a series of spikes in the number of infected patients, leading to collapse of the health systems in many countries. The use of clinical laboratory tests can serve as a discriminatory method for disease severity, defining the profile of patients with a higher risk of mortality. In this paper, we study the results of applying predictive models to data regarding COVID-19 outcome, using three datasets after age stratification of patients. The extreme gradient boosting (XGBoost) algorithm was employed as the predictive method, yielding excellent results. The area under the receiving operator characteristic curve (AUROC) value was 0.97 for the subgroup of patients up to 65 years of age. In addition, SHAP (Shapley additive explanations) was used to analyze the feature importance in the resulting models.

Keywords: COVID-19; feature importance; machine learning; prediction.

Grants and funding

This research received no external funding.