Cluster-based analysis of COVID-19 cases using self-organizing map neural network and K-means methods to improve medical decision-making

Inform Med Unlocked. 2022:32:101005. doi: 10.1016/j.imu.2022.101005. Epub 2022 Jul 5.

Abstract

In this study, we utilized unsupervised machine learning techniques to examine the relationship between different symptoms in cases who died of COVID-19 and cases who recovered from it. First, our data was cleared of redundancies, and the ten most important variables were selected using a filter-based technique (extra-tree classifier). Next, we calculated the Silhouette, Davis Boldin (DB), and the mean intra-cluster distance measures to select the optimal number of clusters, then clustered the data using both the K-means and hierarchical clustering based on Self Organizing Map (SOM) neural network. Our results revealed that patients who died of COVID-19 had high mean values in different symptoms, but not all patients with this characteristic necessarily died. Besides, our result indicated that the patient's age is directly related to the hospital duration, and elderly patients are more likely to be assigned to the intensive care unit (ICU). However, the patient's sex has the same distribution in different groups and does not correlate with other symptoms. In conclusion, our results confirmed past studies. Also, this research helps physicians improve medical services by considering other important factors for treating different groups of COVID-19 patients.

Keywords: COVID-19; Clustering; Neural network; Self-organizing map; Unsupervised machine learning.