Detection of Fake News Text Classification on COVID-19 Using Deep Learning Approaches

Comput Math Methods Med. 2021 Nov 15:2021:5514220. doi: 10.1155/2021/5514220. eCollection 2021.

Abstract

A vast amount of data is generated every second for microblogs, content sharing via social media sites, and social networking. Twitter is an essential popular microblog where people voice their opinions about daily issues. Recently, analyzing these opinions is the primary concern of Sentiment analysis or opinion mining. Efficiently capturing, gathering, and analyzing sentiments have been challenging for researchers. To deal with these challenges, in this research work, we propose a highly accurate approach for SA of fake news on COVID-19. The fake news dataset contains fake news on COVID-19; we started by data preprocessing (replace the missing value, noise removal, tokenization, and stemming). We applied a semantic model with term frequency and inverse document frequency weighting for data representation. In the measuring and evaluation step, we applied eight machine-learning algorithms such as Naive Bayesian, Adaboost, K-nearest neighbors, random forest, logistic regression, decision tree, neural networks, and support vector machine and four deep learning CNN, LSTM, RNN, and GRU. Afterward, based on the results, we boiled a highly efficient prediction model with python, and we trained and evaluated the classification model according to the performance measures (confusion matrix, classification rate, true positives rate...), then tested the model on a set of unclassified fake news on COVID-19, to predict the sentiment class of each fake news on COVID-19. Obtained results demonstrate a high accuracy compared to the other models. Finally, a set of recommendations is provided with future directions for this research to help researchers select an efficient sentiment analysis model on Twitter data.

MeSH terms

  • Algorithms*
  • Bayes Theorem
  • COVID-19*
  • Computational Biology
  • Databases, Factual
  • Decision Trees
  • Deep Learning*
  • Disinformation*
  • Humans
  • Logistic Models
  • Models, Statistical
  • Natural Language Processing
  • Neural Networks, Computer
  • SARS-CoV-2
  • Social Media
  • Social Networking
  • Support Vector Machine