Assessment of a Prediction Model for Antidepressant Treatment Stability Using Supervised Topic Models

Michael C Hughes; Melanie F Pradier; Andrew Slavin Ross; Thomas H McCoy Jr; Roy H Perlis; Finale Doshi-Velez

doi:10.1001/jamanetworkopen.2020.5308

Assessment of a Prediction Model for Antidepressant Treatment Stability Using Supervised Topic Models

JAMA Netw Open. 2020 May 1;3(5):e205308. doi: 10.1001/jamanetworkopen.2020.5308.

Authors

Michael C Hughes¹, Melanie F Pradier², Andrew Slavin Ross², Thomas H McCoy Jr^{3

4}, Roy H Perlis^{3

4}, Finale Doshi-Velez²

Affiliations

¹ Department of Computer Science, Tufts University, Medford, Massachusetts.
² John A. Paulson School of Engineering and Applied Sciences, Cambridge, Massachusetts.
³ Center for Quantitative Health, Massachusetts General Hospital, Boston.
⁴ Harvard Medical School, Boston, Massachusetts.

Abstract

Importance: In the absence of readily assessed and clinically validated predictors of treatment response, pharmacologic management of major depressive disorder often relies on trial and error.

Objective: To assess a model using electronic health records to identify predictors of treatment response in patients with major depressive disorder.

Design, setting, and participants: This retrospective cohort study included data from 81 630 adults with a coded diagnosis of major depressive disorder from 2 academic medical centers in Boston, Massachusetts, including outpatient primary and specialty care clinics from December 1, 1997, to December 31, 2017. Data were analyzed from January 1, 2018, to March 15, 2020.

Exposures: Treatment with at least 1 of 11 standard antidepressants.

Main outcomes and measures: Stable treatment response, intended as a proxy for treatment effectiveness, defined as continued prescription of an antidepressant for 90 days. Supervised topic models were used to extract 10 interpretable covariates from coded clinical data for stability prediction. With use of data from 1 hospital system (site A), generalized linear models and ensembles of decision trees were trained to predict stability outcomes from topic features that summarize patient history. Held-out patients from site A and individuals from a second hospital system (site B) were evaluated.

Results: Among the 81 630 adults (56 340 women [69%]; mean [SD] age, 48.46 [14.75] years; range, 18.0-80.0 years), 55 303 reached a stable response to their treatment regimen during follow-up. For held-out patients from site A, the mean area under the receiver operating characteristic curve (AUC) for discrimination of the general stability outcome was 0.627 (95% CI, 0.615-0.639) for the supervised topic model with 10 covariates. In evaluation of site B, the AUC was 0.619 (95% CI, 0.610-0.627). Building models to predict stability specific to a particular drug did not improve prediction of general stability even when using a harder-to-interpret ensemble classifier and 9256 coded covariates (specific AUC, 0.647; 95% CI, 0.635-0.658; general AUC, 0.661; 95% CI, 0.648-0.672). Topics coherently captured clinical concepts associated with treatment response.

Conclusions and relevance: The findings suggest that coded clinical data available in electronic health records may facilitate prediction of general treatment response but not response to specific medications. Although greater discrimination is likely required for clinical application, the results provide a transparent baseline for such studies.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Adolescent
Adult
Aged
Aged, 80 and over
Antidepressive Agents / therapeutic use*
Depressive Disorder, Major / diagnosis
Depressive Disorder, Major / drug therapy*
Depressive Disorder, Major / psychology
Electronic Health Records
Female
Humans
Male
Middle Aged
Models, Statistical
Remission Induction
Retrospective Studies
Treatment Outcome
Young Adult

Substances

Antidepressive Agents

Abstract

Publication types

MeSH terms

Substances

Grants and funding