Detection of Fake News Text Classification on COVID-19 Using Deep Learning Approaches

Waqas Haider Bangyal; Rukhma Qasim; Najeeb Ur Rehman; Zeeshan Ahmad; Hafsa Dar; Laiqa Rukhsar; Zahra Aman; Jamil Ahmad

doi:10.1155/2021/5514220

Detection of Fake News Text Classification on COVID-19 Using Deep Learning Approaches

Comput Math Methods Med. 2021 Nov 15:2021:5514220. doi: 10.1155/2021/5514220. eCollection 2021.

Authors

Waqas Haider Bangyal¹, Rukhma Qasim¹, Najeeb Ur Rehman¹, Zeeshan Ahmad¹, Hafsa Dar², Laiqa Rukhsar¹, Zahra Aman¹, Jamil Ahmad³

Affiliations

¹ Department of Computer Science, University of Gujrat, Pakistan.
² Department of Software Engineering, University of Gujrat, Pakistan.
³ Professor Computer Science, Hazara University, Manshera, KPK, Pakistan.

Abstract

A vast amount of data is generated every second for microblogs, content sharing via social media sites, and social networking. Twitter is an essential popular microblog where people voice their opinions about daily issues. Recently, analyzing these opinions is the primary concern of Sentiment analysis or opinion mining. Efficiently capturing, gathering, and analyzing sentiments have been challenging for researchers. To deal with these challenges, in this research work, we propose a highly accurate approach for SA of fake news on COVID-19. The fake news dataset contains fake news on COVID-19; we started by data preprocessing (replace the missing value, noise removal, tokenization, and stemming). We applied a semantic model with term frequency and inverse document frequency weighting for data representation. In the measuring and evaluation step, we applied eight machine-learning algorithms such as Naive Bayesian, Adaboost, K-nearest neighbors, random forest, logistic regression, decision tree, neural networks, and support vector machine and four deep learning CNN, LSTM, RNN, and GRU. Afterward, based on the results, we boiled a highly efficient prediction model with python, and we trained and evaluated the classification model according to the performance measures (confusion matrix, classification rate, true positives rate...), then tested the model on a set of unclassified fake news on COVID-19, to predict the sentiment class of each fake news on COVID-19. Obtained results demonstrate a high accuracy compared to the other models. Finally, a set of recommendations is provided with future directions for this research to help researchers select an efficient sentiment analysis model on Twitter data.

MeSH terms

Algorithms*
Bayes Theorem
COVID-19*
Computational Biology
Databases, Factual
Decision Trees
Deep Learning*
Disinformation*
Humans
Logistic Models
Models, Statistical
Natural Language Processing
Neural Networks, Computer
SARS-CoV-2
Social Media
Social Networking
Support Vector Machine