Diabetes disease prediction system using HNB classifier based on discretization method

J Integr Bioinform. 2023 Feb 23;20(1):20210037. doi: 10.1515/jib-2021-0037. eCollection 2023 Mar 1.

Abstract

Diagnosing diabetes early is critical as it helps patients live with the disease in a healthy way - through healthy eating, taking appropriate medical doses, and making patients more vigilant in their movements/activities to avoid wounds that are difficult to heal for diabetic patients. Data mining techniques are typically used to detect diabetes with high confidence to avoid misdiagnoses with other chronic diseases whose symptoms are similar to diabetes. Hidden Naïve Bayes is one of the algorithms for classification, which works under a data-mining model based on the assumption of conditional independence of the traditional Naïve Bayes. The results from this research study, which was conducted on the Pima Indian Diabetes (PID) dataset collection, show that the prediction accuracy of the HNB classifier achieved 82%. As a result, the discretization method increases the performance and accuracy of the HNB classifier.

Keywords: HNB; Pima dataset; classification; data mining; discretization.

MeSH terms

  • Algorithms*
  • Bayes Theorem
  • Data Mining
  • Diabetes Mellitus* / diagnosis
  • Humans
  • Pima People