Exploring YouTube's Recommendation System in the Context of COVID-19 Vaccines: Computational and Comparative Analysis of Video Trajectories

Yee Man Margaret Ng; Katherine Hoffmann Pham; Miguel Luengo-Oroz

doi:10.2196/49061

Exploring YouTube's Recommendation System in the Context of COVID-19 Vaccines: Computational and Comparative Analysis of Video Trajectories

J Med Internet Res. 2023 Sep 15:25:e49061. doi: 10.2196/49061.

Authors

Yee Man Margaret Ng^{1

2}, Katherine Hoffmann Pham¹, Miguel Luengo-Oroz^{1

3}

Affiliations

¹ UN Global Pulse, New York, NY, United States.
² Department of Journalism & Institute of Communications Research, University of Illinois at Urbana-Champaign, Champaign, IL, United States.
³ Biomedical Image Technologies, ETSI Telecomunicación, Universidad Politécnica de Madrid & CIBER-BBN, ISCIII, Madrid, Spain.

PMID: 37713243
PMCID: PMC10506664
DOI: 10.2196/49061

Abstract

Background: Throughout the COVID-19 pandemic, there has been a concern that social media may contribute to vaccine hesitancy due to the wide availability of antivaccine content on social media platforms. YouTube has stated its commitment to removing content that contains misinformation on vaccination. Nevertheless, such claims are difficult to audit. There is a need for more empirical research to evaluate the actual prevalence of antivaccine sentiment on the internet.

Objective: This study examines recommendations made by YouTube's algorithms in order to investigate whether the platform may facilitate the spread of antivaccine sentiment on the internet. We assess the prevalence of antivaccine sentiment in recommended videos and evaluate how real-world users' experiences are different from the personalized recommendations obtained by using synthetic data collection methods, which are often used to study YouTube's recommendation systems.

Methods: We trace trajectories from a credible seed video posted by the World Health Organization to antivaccine videos, following only video links suggested by YouTube's recommendation system. First, we gamify the process by asking real-world participants to intentionally find an antivaccine video with as few clicks as possible. Having collected crowdsourced trajectory data from respondents from (1) the World Health Organization and United Nations system (n_WHO/UN=33) and (2) Amazon Mechanical Turk (n_AMT=80), we next compare the recommendations seen by these users to recommended videos that are obtained from (3) the YouTube application programming interface's RelatedToVideoID parameter (n_RTV=40) and (4) from clean browsers without any identifying cookies (n_CB=40), which serve as reference points. We develop machine learning methods to classify antivaccine content at scale, enabling us to automatically evaluate 27,074 video recommendations made by YouTube.

Results: We found no evidence that YouTube promotes antivaccine content; the average share of antivaccine videos remained well below 6% at all steps in users' recommendation trajectories. However, the watch histories of users significantly affect video recommendations, suggesting that data from the application programming interface or from a clean browser do not offer an accurate picture of the recommendations that real users are seeing. Real users saw slightly more provaccine content as they advanced through their recommendation trajectories, whereas synthetic users were drawn toward irrelevant recommendations as they advanced. Rather than antivaccine content, videos recommended by YouTube are likely to contain health-related content that is not specifically related to vaccination. These videos are usually longer and contain more popular content.

Conclusions: Our findings suggest that the common perception that YouTube's recommendation system acts as a "rabbit hole" may be inaccurate and that YouTube may instead be following a "blockbuster" strategy that attempts to engage users by promoting other content that has been reliably successful across the platform.

Keywords: YouTube; algorithmic auditing; antivaccine sentiment; crowdsourcing; recommendation systems; watch history.

©Yee Man Margaret Ng, Katherine Hoffmann Pham, Miguel Luengo-Oroz. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 15.09.2023.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

COVID-19 Vaccines / therapeutic use
COVID-19* / prevention & control
Communications Media*
Humans
Pandemics / prevention & control
Social Media*

Substances

COVID-19 Vaccines