Detecting Sentiment toward Emerging Infectious Diseases on Social Media: A Validity Evaluation of Dictionary-Based Sentiment Analysis

Int J Environ Res Public Health. 2022 Jun 1;19(11):6759. doi: 10.3390/ijerph19116759.

Abstract

Despite the popularity and efficiency of dictionary-based sentiment analysis (DSA) for public health research, limited empirical evidence has been produced about the validity of DSA and potential harms to the validity of DSA. A random sample of a second-hand Ebola tweet dataset was used to evaluate the validity of DSA compared to the manual coding approach and examine the influences of textual features on the validity of DSA. The results revealed substantial inconsistency between DSA and the manual coding approach. The presence of certain textual features such as negation can partially account for the inconsistency between DSA and manual coding. The findings imply that scholars should be careful and critical about findings in disease-related public health research that use DSA. Certain textual features should be more carefully addressed in DSA.

Keywords: ANEW; LIWC; SentiWordNet; infectious diseases; sentiment analysis; validity.

MeSH terms

  • Attitude
  • Communicable Diseases, Emerging*
  • Humans
  • Public Health
  • Sentiment Analysis
  • Social Media*

Grants and funding

This research received no external funding.