Detecting diseases in medical prescriptions using data mining methods

Sana Nazari Nezhad; Mohammad H Zahedi; Elham Farahani

doi:10.1186/s13040-022-00314-w

Detecting diseases in medical prescriptions using data mining methods

BioData Min. 2022 Nov 24;15(1):29. doi: 10.1186/s13040-022-00314-w.

Authors

Sana Nazari Nezhad¹, Mohammad H Zahedi², Elham Farahani³

Affiliations

¹ Department of Industrial Engineering, K. N. Toosi University of Technology, Tehran, Iran. sana.nazari@email.kntu.ac.ir.
² Department of Industrial Engineering, K. N. Toosi University of Technology, Tehran, Iran.
³ Sharif University of Technology, Tehran, Iran.

Abstract

Every year, the health of millions of people around the world is compromised by misdiagnosis, which sometimes could even lead to death. In addition, it entails huge financial costs for patients, insurance companies, and governments. Furthermore, many physicians' professional life is adversely affected by unintended errors in prescribing medication or misdiagnosing a disease. Our aim in this paper is to use data mining methods to find knowledge in a dataset of medical prescriptions that can be effective in improving the diagnostic process. In this study, using 4 single classification algorithms including decision tree, random forest, simple Bayes, and K-nearest neighbors, the disease and its category were predicted. Then, in order to improve the performance of these algorithms, we used an Ensemble Learning methodology to present our proposed model. In the final step, a number of experiments were performed to compare the performance of different data mining techniques. The final model proposed in this study has an accuracy and kappa score of 62.86% and 0.620 for disease prediction and 74.39% and 0.720 for prediction of the disease category, respectively, which has better performance than other studies in this field.In general, the results of this study can be used to help maintain the health of patients, and prevent the wastage of the financial resources of patients, insurance companies, and governments. In addition, it can aid physicians and help their careers by providing timely information on diagnostic errors. Finally, these results can be used as a basis for future research in this field.

Keywords: Data mining; Prediction; Prescription.