COVID-19 Pandemic: Identifying Key Issues Using Social Media and Natural Language Processing

J Healthc Inform Res. 2022 Feb 11;6(2):174-207. doi: 10.1007/s41666-021-00111-w. eCollection 2022 Jun.

Abstract

The COVID-19 pandemic has affected people's lives in many ways. Social media data can reveal public perceptions and experience with respect to the pandemic, and also reveal factors that hamper or support efforts to curb global spread of the disease. In this paper, we analyzed COVID-19-related comments collected from six social media platforms using natural language processing (NLP) techniques. We identified relevant opinionated keyphrases and their respective sentiment polarity (negative or positive) from over 1 million randomly selected comments, and then categorized them into broader themes using thematic analysis. Our results uncover 34 negative themes out of which 17 are economic, socio-political, educational, and political issues. Twenty (20) positive themes were also identified. We discuss the negative issues and suggest interventions to tackle them based on the positive themes and research evidence.

Keywords: COVID-19; Coronavirus; Health informatics; Keyphrase extraction; Natural language processing; Social media; Text mining; Thematic analysis.