Multiverse: Multilingual Evidence for Fake News Detection

J Imaging. 2023 Mar 27;9(4):77. doi: 10.3390/jimaging9040077.

Abstract

The rapid spread of deceptive information on the internet can have severe and irreparable consequences. As a result, it is important to develop technology that can detect fake news. Although significant progress has been made in this area, current methods are limited because they focus only on one language and do not incorporate multilingual information. In this work, we propose Multiverse-a new feature based on multilingual evidence that can be used for fake news detection and improve existing approaches. Our hypothesis that cross-lingual evidence can be used as a feature for fake news detection is supported by manual experiments based on a set of true (legit) and fake news. Furthermore, we compared our fake news classification system based on the proposed feature with several baselines on two multi-domain datasets of general-topic news and one fake COVID-19 news dataset, showing that (in combination with linguistic features) it yields significant improvements over the baseline models, bringing additional useful signals to the classifier.

Keywords: cross-lingual text similarity; fake news detection; multilinguality; news similarity.

Grants and funding

This research was supported by Social Research Computing Group, School of Computation, Information and Technology, Technical University of Munich.