Assessment of a Prediction Model for Antidepressant Treatment Stability Using Supervised Topic Models

JAMA Netw Open. 2020 May 1;3(5):e205308. doi: 10.1001/jamanetworkopen.2020.5308.

Abstract

Importance: In the absence of readily assessed and clinically validated predictors of treatment response, pharmacologic management of major depressive disorder often relies on trial and error.

Objective: To assess a model using electronic health records to identify predictors of treatment response in patients with major depressive disorder.

Design, setting, and participants: This retrospective cohort study included data from 81 630 adults with a coded diagnosis of major depressive disorder from 2 academic medical centers in Boston, Massachusetts, including outpatient primary and specialty care clinics from December 1, 1997, to December 31, 2017. Data were analyzed from January 1, 2018, to March 15, 2020.

Exposures: Treatment with at least 1 of 11 standard antidepressants.

Main outcomes and measures: Stable treatment response, intended as a proxy for treatment effectiveness, defined as continued prescription of an antidepressant for 90 days. Supervised topic models were used to extract 10 interpretable covariates from coded clinical data for stability prediction. With use of data from 1 hospital system (site A), generalized linear models and ensembles of decision trees were trained to predict stability outcomes from topic features that summarize patient history. Held-out patients from site A and individuals from a second hospital system (site B) were evaluated.

Results: Among the 81 630 adults (56 340 women [69%]; mean [SD] age, 48.46 [14.75] years; range, 18.0-80.0 years), 55 303 reached a stable response to their treatment regimen during follow-up. For held-out patients from site A, the mean area under the receiver operating characteristic curve (AUC) for discrimination of the general stability outcome was 0.627 (95% CI, 0.615-0.639) for the supervised topic model with 10 covariates. In evaluation of site B, the AUC was 0.619 (95% CI, 0.610-0.627). Building models to predict stability specific to a particular drug did not improve prediction of general stability even when using a harder-to-interpret ensemble classifier and 9256 coded covariates (specific AUC, 0.647; 95% CI, 0.635-0.658; general AUC, 0.661; 95% CI, 0.648-0.672). Topics coherently captured clinical concepts associated with treatment response.

Conclusions and relevance: The findings suggest that coded clinical data available in electronic health records may facilitate prediction of general treatment response but not response to specific medications. Although greater discrimination is likely required for clinical application, the results provide a transparent baseline for such studies.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adolescent
  • Adult
  • Aged
  • Aged, 80 and over
  • Antidepressive Agents / therapeutic use*
  • Depressive Disorder, Major / diagnosis
  • Depressive Disorder, Major / drug therapy*
  • Depressive Disorder, Major / psychology
  • Electronic Health Records
  • Female
  • Humans
  • Male
  • Middle Aged
  • Models, Statistical
  • Remission Induction
  • Retrospective Studies
  • Treatment Outcome
  • Young Adult

Substances

  • Antidepressive Agents