Mental health at different stages of cancer survival: a natural language processing study of Reddit posts

Front Psychol. 2023 Jun 23:14:1150227. doi: 10.3389/fpsyg.2023.1150227. eCollection 2023.

Abstract

Introduction: The purpose of this study was to use text-based social media content analysis from cancer-specific subreddits to evaluate depression and anxiety-loaded content. Natural language processing, automatic, and lexicon-based methods were employed to perform sentiment analysis and identify depression and anxiety-loaded content.

Methods: Data was collected from 187 Reddit users who had received a cancer diagnosis, were currently undergoing treatment, or had completed treatment. Participants were split according to survivorship status into short-term, transition, and long-term cancer survivors. A total of 72524 posts were analyzed across the three cancer survivor groups.

Results: The results showed that short-term cancer survivors had significantly more depression-loaded posts and more anxiety-loaded words than long-term survivors, with no significant differences relative to the transition period. The topic analysis showed that long-term survivors, more than other stages of survivorship, have resources to share their experiences with suicidal ideation and mental health issues while providing support to their survivor community.

Discussion: The results indicate that Reddit texts seem to be an indicator of when the stressor is active and mental health issues are triggered. This sets the stage for Reddit to become a platform for screening and first-hand intervention delivery. Special attention should be dedicated to short-term survivors.

Keywords: Reddit; cancer survivors; mental health; natural language processing; social media.