Twitter-based measures of neighborhood sentiment as predictors of residential population health

Joseph Gibbons; Robert Malouf; Brian Spitzberg; Lourdes Martinez; Bruce Appleyard; Caroline Thompson; Atsushi Nara; Ming-Hsiang Tsou

doi:10.1371/journal.pone.0219550

Twitter-based measures of neighborhood sentiment as predictors of residential population health

PLoS One. 2019 Jul 11;14(7):e0219550. doi: 10.1371/journal.pone.0219550. eCollection 2019.

Authors

Joseph Gibbons¹, Robert Malouf², Brian Spitzberg³, Lourdes Martinez³, Bruce Appleyard⁴, Caroline Thompson⁵, Atsushi Nara⁶, Ming-Hsiang Tsou⁶

Affiliations

¹ Department of Sociology, San Diego State University, San Diego, California, United States of America.
² Department of Linguistics and Asian/Middle Eastern Languages, San Diego State University, San Diego, California, United States of America.
³ School of Communication, San Diego State University, San Diego, California, United States of America.
⁴ School of Public Affairs and Fine Arts, San Diego State University, San Diego, California, United States of America.
⁵ School of Public Health, San Diego State University, San Diego, California, United States of America.
⁶ Department of Geography, San Diego State University, San Diego, California, United States of America.

Abstract

Several studies have recently applied sentiment-based lexicons to Twitter to gauge local sentiment to understand health behaviors and outcomes for local areas. While this research has demonstrated the vast potential of this approach, lingering questions remain regarding the validity of Twitter mining and surveillance in local health research. First, how well does this approach predict health outcomes at very local scales, such as neighborhoods? Second, how robust are the findings garnered from sentiment signals when accounting for spatial effects? To evaluate these questions, we link 2,076,025 tweets from 66,219 distinct users in the city of San Diego over the period of 2014-12-06 to 2017-05-24 to the 500 Cities Project data and 2010-2014 American Community Survey data. We determine how well sentiment predicts self-rated mental health, sleep quality, and heart disease at a census tract level, controlling for neighborhood characteristics and spatial autocorrelation. We find that sentiment is related to some outcomes on its own, but these relationships are not present when controlling for other neighborhood factors. Evaluating our encoding strategy more closely, we discuss the limitations of existing measures of neighborhood sentiment, calling for more attention to how race/ethnicity and socio-economic status play into inferences drawn from such measures.

Publication types

Research Support, N.I.H., Extramural
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Cardiovascular Diseases / epidemiology*
Censuses
Cities
Ethnicity
Happiness
Humans
Mental Health*
Population Health*
Public Opinion
Semantics
Social Media*
United States / epidemiology

Grants and funding

U54 MD012397/MD/NIMHD NIH HHS/United States