Use and performance of machine learning models for type 2 diabetes prediction in community settings: A systematic review and meta-analysis

Int J Med Inform. 2020 Nov:143:104268. doi: 10.1016/j.ijmedinf.2020.104268. Epub 2020 Sep 7.

Abstract

Objective: We aimed to identify machine learning (ML) models for type 2 diabetes (T2DM) prediction in community settings and determine their predictive performance.

Method: Systematic review of ML predictive modelling studies in 13 databases since 2009 was conducted. Primary outcomes included metrics of discrimination, calibration, and classification. Secondary outcomes included important variables, level of validation, and intended use of models. Meta-analysis of c-indices, subgroup analyses, meta-regression, publication bias assessments and sensitivity analyses were conducted.

Results: Twenty-three studies (40 prediction models) were included. Studies with high-, moderate-, and low- risk of bias were 3, 14, and 6 respectively. All studies conducted internal validation whereas none conducted external validation of their models. Twenty studies provided classification metrics to varying extents whereas only 7 studies performed model calibration. Eighteen studies reported information on both the variables used for model development and the feature importance. Twelve studies highlighted potential applicability of their models for T2DM screening. Meta-analysis produced a good pooled c-index (0.812). Sources of heterogeneity were identified through subgroup analyses and meta-regression. Issues pertaining to methodological quality and reporting were observed.

Conclusions: We found evidence of good performance of ML models for T2DM prediction in the community. Improvements to methodology, reporting and validation are needed before they can be used at scale.

Keywords: Diabetes mellitus; Diagnosis; Machine learning; Meta-Analysis; Prognosis; Type 2.

Publication types

  • Meta-Analysis
  • Research Support, Non-U.S. Gov't
  • Review
  • Systematic Review

MeSH terms

  • Diabetes Mellitus, Type 2* / diagnosis
  • Humans
  • Machine Learning
  • Mass Screening