An intelligent diabetes classification and perception framework based on ensemble and deep learning method

PeerJ Comput Sci. 2024 Mar 29:10:e1914. doi: 10.7717/peerj-cs.1914. eCollection 2024.

Abstract

Sugar in the blood can harm individuals and their vital organs, potentially leading to blindness, renal illness, as well as kidney and heart diseases. Globally, diabetic patients face an average annual mortality rate of 38%. This study employs Chi-square, mutual information, and sequential feature selection (SFS) to choose features for training multiple classifiers. These classifiers include an artificial neural network (ANN), a random forest (RF), a gradient boosting (GB) algorithm, Tab-Net, and a support vector machine (SVM). The goal is to predict the onset of diabetes at an earlier age. The classifier, developed based on the selected features, aims to enable early diagnosis of diabetes. The PIMA and early-risk diabetes datasets serve as test subjects for the developed system. The feature selection technique is then applied to focus on the most important and relevant features for model training. The experiment findings conclude that the ANN exhibited a spectacular performance in terms of accuracy on the PIMA dataset, achieving a remarkable accuracy rate of 99.35%. The second experiment, conducted on the early diabetes risk dataset using selected features, revealed that RF achieved an accuracy of 99.36%. Based on our experimental results, it can be concluded that our suggested method significantly outperformed baseline machine learning algorithms already employed for diabetes prediction on both datasets.

Keywords: Artificial neural network; Diabetes; Machine learning; SFS.

Grants and funding

This work was supported by the Institute for Information and Communications Technology Promotion (IITP) (No. 2022-0-00980, Cooperative Intelligence Framework of Scene Perception for Autonomous IoT Device). This research was also supported by the Brain Pool program funded by the Ministry of Science and ICT through the National Research Foundation of Korea (2021H1D3A2A02082991). The APC was supported by the 2023 scientific promotion program funded by Jeju National University. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.