Feasibility Study of Constructing a Screening Tool for Adolescent Diabetes Detection Applying Machine Learning Methods

Sensors (Basel). 2022 Aug 17;22(16):6155. doi: 10.3390/s22166155.

Abstract

Prediabetes and diabetes are becoming alarmingly prevalent among adolescents over the past decade. However, an effective screening tool that can assess diabetes risks smoothly is still in its infancy. In order to contribute to such significant gaps, this research proposes a machine learning-based predictive model to detect adolescent diabetes. The model applies supervised machine learning and a novel feature selection method to the National Health and Nutritional Examination Survey datasets after an exhaustive search to select reliable and accurate data. The best model achieved an area under the curve (AUC) score of 71%. This research proves that a screening tool based on supervised machine learning models can assist in the automated detection of youth diabetes. It also identifies some critical predictors to such detection using Lasso Regression, Random Forest Importance and Gradient Boosted Tree Importance feature selection methods. The most contributing features to Youth diabetes detection are physical characteristics (e.g., waist, leg length, gender), dietary information (e.g., water, protein, sodium) and demographics. These predictors can be further utilised in other areas of medical research, such as electronic medical history.

Keywords: adolescent diabetes prediction; diabetes detection; medical machine learning.

MeSH terms

  • Adolescent
  • Area Under Curve
  • Diabetes Mellitus*
  • Feasibility Studies
  • Humans
  • Machine Learning*
  • Nutrition Surveys

Grants and funding

This research was funded by Western Sydney University.