The Belt and Road Initiative on Twitter: An annotated dataset

Data Brief. 2022 Nov 1:45:108711. doi: 10.1016/j.dib.2022.108711. eCollection 2022 Dec.

Abstract

Initiated by the Chinese president Xi Jinping in 2013, the Belt and Road initiative (BRI) is a multi-trillion-dollar agenda for facilitating trade and investment, especially massive infrastructural developments. In recent years, discussions around the BRI have been increasing as more than 130 countries and 30 international organizations have officially joined the initiative [1], collaborating in a series of transnational infrastructure projects funded by Chinese companies or the Chinese state. This dataset provides 500,711 posts and 714,794 reposting threads related to the BRI on Twitter. The dataset was collected through the Twitter API by applying a set of keywords: "belt and road", "one belt one road", "new silk road", "maritime silk road", and "silk road economic belt", which included the words and their hashtag forms to download the raw data from Twitter. The time series of the dataset is from 7 September 2013 to 30 November 2021. Furthermore, the dataset is annotated in terms of languages, emotional polarity, geopolitical entities, and credibility by employing textual analytics in language detection, neural machine translation, and lexicon-based sentiment analysis. To facilitate future research, we classified the dataset into three databases that can be analyzed separately and reused in research related to various fields, such as political science, network science, and sociology to study public opinions about the BRI and their dissemination patterns.

Keywords: Belt and Road Initiative; Geopolitics; Retweets; Social media; Social networks; Tweets; Twitter.