Spontaneously generated online patient experience data - how and why is it being used in health research: an umbrella scoping review

BMC Med Res Methodol. 2022 May 14;22(1):139. doi: 10.1186/s12874-022-01610-z.

Abstract

Purpose: Social media has led to fundamental changes in the way that people look for and share health related information. There is increasing interest in using this spontaneously generated patient experience data as a data source for health research. The aim was to summarise the state of the art regarding how and why SGOPE data has been used in health research. We determined the sites and platforms used as data sources, the purposes of the studies, the tools and methods being used, and any identified research gaps.

Methods: A scoping umbrella review was conducted looking at review papers from 2015 to Jan 2021 that studied the use of SGOPE data for health research. Using keyword searches we identified 1759 papers from which we included 58 relevant studies in our review.

Results: Data was used from many individual general or health specific platforms, although Twitter was the most widely used data source. The most frequent purposes were surveillance based, tracking infectious disease, adverse event identification and mental health triaging. Despite the developments in machine learning the reviews included lots of small qualitative studies. Most NLP used supervised methods for sentiment analysis and classification. Very early days, methods need development. Methods not being explained. Disciplinary differences - accuracy tweaks vs application. There is little evidence of any work that either compares the results in both methods on the same data set or brings the ideas together.

Conclusion: Tools, methods, and techniques are still at an early stage of development, but strong consensus exists that this data source will become very important to patient centred health research.

Keywords: Health research; Machine learning; Methods; Natural language processing; Social media; Text analysis; Umbrella review.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Data Collection
  • Humans
  • Information Storage and Retrieval
  • Machine Learning
  • Patient Outcome Assessment
  • Social Media*