Identifying data-driven subtypes of major depressive disorder with electronic health records

J Affect Disord. 2024 Jul 1:356:64-70. doi: 10.1016/j.jad.2024.03.162. Epub 2024 Apr 1.

Abstract

Background: Efforts to reduce the heterogeneity of major depressive disorder (MDD) by identifying subtypes have not yet facilitated treatment personalization or investigation of biology, so novel approaches merit consideration.

Methods: We utilized electronic health records drawn from 2 academic medical centers and affiliated health systems in Massachusetts to identify data-driven subtypes of MDD, characterizing sociodemographic features, comorbid diagnoses, and treatment patterns. We applied Latent Dirichlet Allocation (LDA) to summarize diagnostic codes followed by agglomerative clustering to define patient subgroups.

Results: Among 136,371 patients (95,034 women [70 %]; 41,337 men [30 %]; mean [SD] age, 47.0 [14.0] years), the 15 putative MDD subtypes were characterized by comorbidities and distinct patterns in medication use. There was substantial variation in rates of selective serotonin reuptake inhibitor (SSRI) use (from a low of 62 % to a high of 78 %) and selective norepinephrine reuptake inhibitor (SNRI) use (from 4 % to 21 %).

Limitations: Electronic health records lack reliable symptom-level data, so we cannot examine the extent to which subtypes might differ in clinical presentation or symptom dimensions.

Conclusion: These data-driven subtypes, drawing on representative clinical cohorts, merit further investigation for their utility in identifying more homogeneous patient populations for basic as well as clinical investigation.

Keywords: Data-driven subtypes; Heterogeneity; Latent Dirichlet allocation; Major depressive disorder; Representation learning.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Adult
  • Comorbidity
  • Depressive Disorder, Major* / classification
  • Depressive Disorder, Major* / diagnosis
  • Depressive Disorder, Major* / drug therapy
  • Depressive Disorder, Major* / epidemiology
  • Electronic Health Records* / statistics & numerical data
  • Female
  • Humans
  • Male
  • Massachusetts / epidemiology
  • Middle Aged
  • Selective Serotonin Reuptake Inhibitors* / therapeutic use
  • Serotonin and Noradrenaline Reuptake Inhibitors / therapeutic use

Substances

  • Selective Serotonin Reuptake Inhibitors
  • Serotonin and Noradrenaline Reuptake Inhibitors