Fake or not? Automated detection of COVID-19 misinformation and disinformation in social networks and digital media

Izzat Alsmadi; Natalie Manaeva Rice; Michael J O'Brien

doi:10.1007/s10588-022-09369-w

Fake or not? Automated detection of COVID-19 misinformation and disinformation in social networks and digital media

Comput Math Organ Theory. 2022 Nov 25:1-19. doi: 10.1007/s10588-022-09369-w. Online ahead of print.

Authors

Izzat Alsmadi¹, Natalie Manaeva Rice², Michael J O'Brien³

Affiliations

¹ Department of Computing and Cyber Security, Texas A&M University-San Antonio, San Antonio, USA.
² Center for Information and Communication Studies, University of Tennessee, Knoxville, USA.
³ Department of Communication, History, and Philosophy, Department of Life Sciences, Texas A&M University, San Antonio, USA.

Abstract

With the continuous spread of the COVID-19 pandemic, misinformation poses serious threats and concerns. COVID-19-related misinformation integrates a mixture of health aspects along with news and political misinformation. This mixture complicates the ability to judge whether a claim related to COVID-19 is information, misinformation, or disinformation. With no standard terminology in information and disinformation, integrating different datasets and using existing classification models can be impractical. To deal with these issues, we aggregated several COVID-19 misinformation datasets and compared differences between learning models from individual datasets versus one that was aggregated. We also evaluated the impact of using several word- and sentence-embedding models and transformers on the performance of classification models. We observed that whereas word-embedding models showed improvements in all evaluated classification models, the improvement level varied among the different classifiers. Although our work was focused on COVID-19 misinformation detection, a similar approach can be applied to myriad other topics, such as the recent Russian invasion of Ukraine.

Keywords: COVID-19; Coronavirus; Disinformation; Learning models; Misinformation.

© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.