Trajectories of clinical characteristics, complications and treatment choices in data-driven subgroups of type 2 diabetes

Xinyu Li; Louise A Donnelly; Roderick C Slieker; Joline W J Beulens; Leen M 't Hart; Petra J M Elders; Ewan R Pearson; Anoukh van Giessen; Jose Leal; Talitha Feenstra

doi:10.1007/s00125-024-06147-y

Trajectories of clinical characteristics, complications and treatment choices in data-driven subgroups of type 2 diabetes

Diabetologia. 2024 Apr 16. doi: 10.1007/s00125-024-06147-y. Online ahead of print.

Authors

Xinyu Li¹, Louise A Donnelly², Roderick C Slieker^{3

4

5}, Joline W J Beulens^{4

5

6}, Leen M 't Hart^{3

4

5

7}, Petra J M Elders^{5

8}, Ewan R Pearson², Anoukh van Giessen⁹, Jose Leal¹⁰, Talitha Feenstra^{11

9}

Affiliations

¹ Groningen Research Institute of Pharmacy, Faculty of Science and Engineering, University of Groningen, Groningen, the Netherlands. li.xinyu@rug.nl.
² Division of Population Health and Genomics, Ninewells Hospital and School of Medicine, University of Dundee, Dundee, UK.
³ Department of Cell and Chemical Biology, Leiden University Medical Center, Leiden, the Netherlands.
⁴ Department of Epidemiology and Data Science, Amsterdam UMC, Location Vrije Universiteit Amsterdam, Amsterdam, the Netherlands.
⁵ Amsterdam Public Health, Amsterdam Cardiovascular Sciences, Amsterdam, the Netherlands.
⁶ Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, Netherlands.
⁷ Department of Biomedical Data Sciences, Section Molecular Epidemiology, Leiden University Medical Center, Leiden, the Netherlands.
⁸ Department of General Practice, Amsterdam University Medical Center, Location Vrije Universiteit Amsterdam, Amsterdam, the Netherlands.
⁹ National Institute of Public Health and the Environment, Bilthoven, the Netherlands.
¹⁰ Health Economics Research Centre, Nuffield Department of Population Health, University of Oxford, Oxford, UK.
¹¹ Groningen Research Institute of Pharmacy, Faculty of Science and Engineering, University of Groningen, Groningen, the Netherlands.

PMID: 38625583
DOI: 10.1007/s00125-024-06147-y

Abstract

Aims/hypothesis: This study aimed to explore the added value of subgroups that categorise individuals with type 2 diabetes by k-means clustering for two primary care registries (the Netherlands and Scotland), inspired by Ahlqvist's novel diabetes subgroups and previously analysed by Slieker et al. METHODS: We used two Dutch and Scottish diabetes cohorts (N=3054 and 6145; median follow-up=11.2 and 12.3 years, respectively) and defined five subgroups by k-means clustering with age at baseline, BMI, HbA_1c, HDL-cholesterol and C-peptide. We investigated differences between subgroups by trajectories of risk factor values (random intercept models), time to diabetes-related complications (logrank tests and Cox models) and medication patterns (multinomial logistic models). We also compared directly using the clustering indicators as predictors of progression vs the k-means discrete subgroups. Cluster consistency over follow-up was assessed.

Results: Subgroups' risk factors were significantly different, and these differences remained generally consistent over follow-up. Among all subgroups, individuals with severe insulin resistance faced a significantly higher risk of myocardial infarction both before (HR 1.65; 95% CI 1.40, 1.94) and after adjusting for age effect (HR 1.72; 95% CI 1.46, 2.02) compared with mild diabetes with high HDL-cholesterol. Individuals with severe insulin-deficient diabetes were most intensively treated, with more than 25% prescribed insulin at 10 years of diagnosis. For severe insulin-deficient diabetes relative to mild diabetes, the relative risks for using insulin relative to no common treatment would be expected to increase by a factor of 3.07 (95% CI 2.73, 3.44), holding other factors constant. Clustering indicators were better predictors of progression variation relative to subgroups, but prediction accuracy may improve after combining both. Clusters were consistent over 8 years with an accuracy ranging from 59% to 72%.

Conclusions/interpretation: Data-driven subgroup allocations were generally consistent over follow-up and captured significant differences in risk factor trajectories, medication patterns and complication risks. Subgroups serve better as a complement rather than as a basis for compressing clustering indicators.

Keywords: Data-driven subgroups; Longitudinal analysis; Real-world data; Routine care; Stratification of diabetes.

Grants and funding

115881/Horizon 2020 Framework Programme