A Novel Approach for Continuous Health Status Monitoring and Automatic Detection of Infection Incidences in People With Type 1 Diabetes Using Machine Learning Algorithms (Part 2): A Personalized Digital Infectious Disease Detection Mechanism

Ashenafi Zebene Woldaregay; Ilkka Kalervo Launonen; David Albers; Jorge Igual; Eirik Årsand; Gunnar Hartvigsen

doi:10.2196/18912

A Novel Approach for Continuous Health Status Monitoring and Automatic Detection of Infection Incidences in People With Type 1 Diabetes Using Machine Learning Algorithms (Part 2): A Personalized Digital Infectious Disease Detection Mechanism

J Med Internet Res. 2020 Aug 12;22(8):e18912. doi: 10.2196/18912.

Authors

Ashenafi Zebene Woldaregay¹, Ilkka Kalervo Launonen², David Albers^{3

4}, Jorge Igual⁵, Eirik Årsand¹, Gunnar Hartvigsen¹

Affiliations

¹ Department of Computer Science, University of Tromsø - The Arctic University of Norway, Tromsø, Norway.
² Department of Clinical Research, University Hospital of North Norway, Tromsø, Norway.
³ Department of Pediatrics, Informatics and Data Science, University of Colorado, Aurora, CO, United States.
⁴ Department of Biomedical Informatics, Columbia University, New York, NY, United States.
⁵ Universitat Politècnica de València, Valencia, Spain.

PMID: 32784179
PMCID: PMC7450372
DOI: 10.2196/18912

Abstract

Background: Semisupervised and unsupervised anomaly detection methods have been widely used in various applications to detect anomalous objects from a given data set. Specifically, these methods are popular in the medical domain because of their suitability for applications where there is a lack of a sufficient data set for the other classes. Infection incidence often brings prolonged hyperglycemia and frequent insulin injections in people with type 1 diabetes, which are significant anomalies. Despite these potentials, there have been very few studies that focused on detecting infection incidences in individuals with type 1 diabetes using a dedicated personalized health model.

Objective: This study aims to develop a personalized health model that can automatically detect the incidence of infection in people with type 1 diabetes using blood glucose levels and insulin-to-carbohydrate ratio as input variables. The model is expected to detect deviations from the norm because of infection incidences considering elevated blood glucose levels coupled with unusual changes in the insulin-to-carbohydrate ratio.

Methods: Three groups of one-class classifiers were trained on target data sets (regular days) and tested on a data set containing both the target and the nontarget (infection days). For comparison, two unsupervised models were also tested. The data set consists of high-precision self-recorded data collected from three real subjects with type 1 diabetes incorporating blood glucose, insulin, diet, and events of infection. The models were evaluated on two groups of data: raw and filtered data and compared based on their performance, computational time, and number of samples required.

Results: The one-class classifiers achieved excellent performance. In comparison, the unsupervised models suffered from performance degradation mainly because of the atypical nature of the data. Among the one-class classifiers, the boundary and domain-based method produced a better description of the data. Regarding the computational time, nearest neighbor, support vector data description, and self-organizing map took considerable training time, which typically increased as the sample size increased, and only local outlier factor and connectivity-based outlier factor took considerable testing time.

Conclusions: We demonstrated the applicability of one-class classifiers and unsupervised models for the detection of infection incidence in people with type 1 diabetes. In this patient group, detecting infection can provide an opportunity to devise tailored services and also to detect potential public health threats. The proposed approaches achieved excellent performance; in particular, the boundary and domain-based method performed better. Among the respective groups, particular models such as one-class support vector machine, K-nearest neighbor, and K-means achieved excellent performance in all the sample sizes and infection cases. Overall, we foresee that the results could encourage researchers to examine beyond the presented features into other additional features of the self-recorded data, for example, continuous glucose monitoring features and physical activity data, on a large scale.

Keywords: decision support techniques; infection detection; outbreak detection system; self-recorded health data; syndromic surveillance; type 1 diabetes.

©Ashenafi Zebene Woldaregay, Ilkka Kalervo Launonen, David Albers, Jorge Igual, Eirik Årsand, Gunnar Hartvigsen. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 12.08.2020.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Diabetes Complications / complications*
Diabetes Mellitus, Type 1 / complications*
Humans
Incidence
Machine Learning / standards*
Precision Medicine / methods*

Grants and funding

R01 LM012734/LM/NLM NIH HHS/United States