A novel few-shot learning based multi-modality fusion model for COVID-19 rumor detection from online social media

PeerJ Comput Sci. 2021 Aug 20:7:e688. doi: 10.7717/peerj-cs.688. eCollection 2021.

Abstract

Background: Rumor detection is a popular research topic in natural language processing and data mining. Since the outbreak of COVID-19, related rumors have been widely posted and spread on online social media, which have seriously affected people's daily lives, national economy, social stability, etc. It is both theoretically and practically essential to detect and refute COVID-19 rumors fast and effectively. As COVID-19 was an emergent event that was outbreaking drastically, the related rumor instances were very scarce and distinct at its early stage. This makes the detection task a typical few-shot learning problem. However, traditional rumor detection techniques focused on detecting existed events with enough training instances, so that they fail to detect emergent events such as COVID-19. Therefore, developing a new few-shot rumor detection framework has become critical and emergent to prevent outbreaking rumors at early stages.

Methods: This article focuses on few-shot rumor detection, especially for detecting COVID-19 rumors from Sina Weibo with only a minimal number of labeled instances. We contribute a Sina Weibo COVID-19 rumor dataset for few-shot rumor detection and propose a few-shot learning-based multi-modality fusion model for few-shot rumor detection. A full microblog consists of the source post and corresponding comments, which are considered as two modalities and fused with the meta-learning methods.

Results: Experiments of few-shot rumor detection on the collected Weibo dataset and the PHEME public dataset have shown significant improvement and generality of the proposed model.

Keywords: COVID-19; Few-shot learning; Multi-modality; Rumor detection; Social media.

Associated data

  • figshare/10.6084/m9.figshare.4010619.v1

Grants and funding

This work was supported by the National Natural Science Foundation of China (No.62002137, 61876072), the Fundamental Research Funds for the Central Universities (No. JUSRP12021), and the State Key Lab for Novel Software Technology, Nanjing University, P.R. China (No. KFKT2020B02). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.