Sub-clustering based recommendation system for stroke patient: Identification of a specific drug class for a given patient

Ribot Fleury T Ceskoutsé; Alain Bertrand Bomgni; David R Gnimpieba Zanfack; Diing D M Agany; Thomas Bouetou Bouetou; Etienne Gnimpieba Zohim

doi:10.1016/j.compbiomed.2024.108117

Sub-clustering based recommendation system for stroke patient: Identification of a specific drug class for a given patient

Comput Biol Med. 2024 Mar:171:108117. doi: 10.1016/j.compbiomed.2024.108117. Epub 2024 Feb 7.

Authors

Ribot Fleury T Ceskoutsé¹, Alain Bertrand Bomgni², David R Gnimpieba Zanfack³, Diing D M Agany⁴, Thomas Bouetou Bouetou⁵, Etienne Gnimpieba Zohim⁶

Affiliations

¹ Ecole Nationale Supérieure Polytechnique, University of Yaounde I, P.O. Box. 8390, Yaoundé, Cameroon. Electronic address: fleurytene@gmail.com.
² University of South Dakota, 4800 N Career Avenue, 57107, SD, USA; Departement of Mathematics and computer science, University of Dschang, P.O. Box. 67, Dschang, Cameroon. Electronic address: alain.bomgni@usd.edu.
³ Laboratory of Innovative Technologies (LTI), University of Picardie Jule Verne (UPJV), 48 Rue Raspail, 02100 Saint Quentin, France. Electronic address: davgnimpie@gmail.com.
⁴ University of South Dakota, 4800 N Career Avenue, 57107, SD, USA. Electronic address: diing.agany@coyotes.usd.edu.
⁵ Ecole Nationale Supérieure Polytechnique, University of Yaounde I, P.O. Box. 8390, Yaoundé, Cameroon. Electronic address: tbouetou@gmail.com.
⁶ University of South Dakota, 4800 N Career Avenue, 57107, SD, USA. Electronic address: etienne.ngnimpieba@usd.edu.

PMID: 38335820
PMCID: PMC10981530 (available on 2025-03-01)
DOI: 10.1016/j.compbiomed.2024.108117

Abstract

Stroke is one of the leading causes of death worldwide. Previous studies have explored machine learning techniques for early detection of stroke patients using content-based recommendation systems. However, these models often struggle with timely detection of medications, which can be critical for patient management and decision-making regarding the prescription of new drugs. In this study, we developed a content-based recommendation model using three machine learning algorithms: Gaussian Mixture Model (GMM), Affinity Propagation (AP), and K-Nearest Neighbors (KNN), to aid Healthcare Professionals (HCP) in quickly detecting medications based on the symptoms of a patient with stroke. Our model focused on three classes of drugs: antihypertensive, anticoagulant, and fibrate. Each machine learning algorithm was used to accomplish specific tasks, thereby reducing the partial search space, computational cost, and accurately detecting a primary drug class without loss of precision and accuracy. Our proposed model, called CRGANNC (Clustering Recommendation Gaussian Affinity Nearest Neighbors Classifier), effectively addresses the sparsity and scalability issues faced by content-based recommendation models. The CRGANNC model dynamically partition clusters into sub-clusters with variable numbers based on the group, and can diagnose healthy, sick, and at-risk patients, and recommend drugs to the HCP. In addition to our analysis, we developed a semi-artificial dataset with new features such as weakness, dizziness, headache, nausea, and vomiting, using a pipeline. This dataset serves as a valuable resource for researchers in the sensitive domain of stroke, providing a starting point for building and testing models when real data is often restricted. Our work not only contributes to the development of predictive models for stroke but also establishes a framework for creating similar datasets in other sensitive domains, accelerating research efforts and improving patient care. Our experiments were conducted on our dataset consisting of 9691 patient records, with 1206 records for stroke attacks and 8485 healthy patients. The CRGANNC model achieved an average precision of 0.98, recall of 0.95 and F1-score of 0.96 across all three drugs classes. Furthermore, our model demonstrated significant improvement in computational efficiency compared to existing content-based recommendation models, reducing the processing time by 25.80% . This results indicate the effectiveness of our model in accurately detecting medications for stroke patients based on their symptoms.

Keywords: Content based filtering; Machine learning; Recommender system; Stroke disease.

MeSH terms

Algorithms*
Cluster Analysis
Dizziness*
Fibric Acids
Head
Humans

Substances

Fibric Acids

Grants and funding

P20 GM103443/GM/NIGMS NIH HHS/United States