From benchmark to bedside: transfer learning from social media to patient-provider text messages for suicide risk prediction

Hannah A Burkhardt; Xiruo Ding; Amanda Kerbrat; Katherine Anne Comtois; Trevor Cohen

doi:10.1093/jamia/ocad062

From benchmark to bedside: transfer learning from social media to patient-provider text messages for suicide risk prediction

J Am Med Inform Assoc. 2023 May 19;30(6):1068-1078. doi: 10.1093/jamia/ocad062.

Authors

Hannah A Burkhardt¹, Xiruo Ding¹, Amanda Kerbrat², Katherine Anne Comtois², Trevor Cohen¹

Affiliations

¹ Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, Washington, USA.
² Department of Psychiatry and Behavioral Sciences, University of Washington, Seattle, Washington, USA.

Abstract

Objective: Compared to natural language processing research investigating suicide risk prediction with social media (SM) data, research utilizing data from clinical settings are scarce. However, the utility of models trained on SM data in text from clinical settings remains unclear. In addition, commonly used performance metrics do not directly translate to operational value in a real-world deployment. The objectives of this study were to evaluate the utility of SM-derived training data for suicide risk prediction in a clinical setting and to develop a metric of the clinical utility of automated triage of patient messages for suicide risk.

Materials and methods: Using clinical data, we developed a Bidirectional Encoder Representations from Transformers-based suicide risk detection model to identify messages indicating potential suicide risk. We used both annotated and unlabeled suicide-related SM posts for multi-stage transfer learning, leveraging customized contemporary learning rate schedules. We also developed a novel metric estimating predictive models' potential to reduce follow-up delays with patients in distress and used it to assess model utility.

Results: Multi-stage transfer learning from SM data outperformed baseline approaches by traditional classification performance metrics, improving performance from 0.734 to a best F1 score of 0.797. Using this approach for automated triage could reduce response times by 15 minutes per urgent message.

Discussion: Despite differences in data characteristics and distribution, publicly available SM data benefit clinical suicide risk prediction when used in conjunction with contemporary transfer learning techniques. Estimates of time saved due to automated triage indicate the potential for the practical impact of such models when deployed as part of established suicide prevention interventions.

Conclusions: This work demonstrates a pathway for leveraging publicly available SM data toward improving risk assessment, paving the way for better clinical care and improved clinical outcomes.

Keywords: artificial intelligence; computer assisted; decision-making; delivery of health care; natural language processing; social media; suicide prevention.

Publication types

Research Support, U.S. Gov't, Non-P.H.S.
Research Support, Non-U.S. Gov't

MeSH terms

Benchmarking
Humans
Machine Learning
Social Media*
Suicide*
Text Messaging*