MultiWD: Multi-label wellness dimensions in social media posts

Muskan Garg; Xingyi Liu; M S V P J Sathvik; Shaina Raza; Sunghwan Sohn

doi:10.1016/j.jbi.2024.104586

MultiWD: Multi-label wellness dimensions in social media posts

J Biomed Inform. 2024 Feb:150:104586. doi: 10.1016/j.jbi.2024.104586. Epub 2024 Jan 6.

Authors

Muskan Garg¹, Xingyi Liu², M S V P J Sathvik³, Shaina Raza⁴, Sunghwan Sohn⁵

Affiliations

¹ Mayo Clinic, Rochester, 55901 MN, USA. Electronic address: garg.muskan@mayo.edu.
² Mayo Clinic, Rochester, 55901 MN, USA. Electronic address: liu.xingyi@mayo.edu.
³ IIIT Dharwad, Goa, 580011 IN, India. Electronic address: 20bec024@iiitdwd.ac.in.
⁴ Vector Institute for Artificial Intelligence, Toronto, M5G 1M1 ON, Canada. Electronic address: shaina.raza@vectorinstitute.ai.
⁵ Mayo Clinic, Rochester, 55901 MN, USA. Electronic address: sohn.sunghwan@mayo.edu.

PMID: 38191011
PMCID: PMC10923126 (available on 2025-02-01)
DOI: 10.1016/j.jbi.2024.104586

Abstract

Background: Halbert L. Dunn's concept of wellness is a multi-dimensional aspect encompassing social and mental well-being. Neglecting these dimensions over time can have a negative impact on an individual's mental health. The manual efforts employed in in-person therapy sessions reveal that underlying factors of mental disturbance if triggered, may lead to severe mental health disorders.

Objective: In our research, we introduce a fine-grained approach focused on identifying indicators of wellness dimensions and mark their presence in self-narrated human-writings on Reddit social media platform.

Design and method: We present the MultiWD dataset, a curated collection comprising 3281 instances, as a specifically designed and annotated dataset that facilitates the identification of multiple wellness dimensions in Reddit posts. In our study, we introduce the task of identifying wellness dimensions and utilize state-of-the-art classifiers to solve this multi-label classification task.

Results: Our findings highlights the best and comparative performance of fine-tuned large language models with fine-tuned BERT model. As such, we set BERT as a baseline model to tag wellness dimensions in a user-penned text with F1 score of 76.69.

Conclusion: Our findings underscore the need of trustworthy and domain-specific knowledge infusion to develop more comprehensive and contextually-aware AI models for tagging and extracting wellness dimensions.

Keywords: Dataset; Mental health; Multi-label classification; Wellness dimensions.

Publication types

Research Support, N.I.H., Extramural

MeSH terms

Awareness
Humans
Mental Disorders*
Mental Health
Social Media*

Grants and funding

R01 AG068007/AG/NIA NIH HHS/United States