Bring Your Own Location Data: Use of Google Smartphone Location History Data for Environmental Health Research

Environ Health Perspect. 2022 Nov;130(11):117005. doi: 10.1289/EHP10829. Epub 2022 Nov 10.

Abstract

Background: Environmental exposures are commonly estimated using spatial methods, with most epidemiological studies relying on home addresses. Passively collected smartphone location data, like Google Location History (GLH) data, may present an opportunity to integrate existing long-term time-activity data.

Objectives: We aimed to evaluate the potential use of GLH data for capturing long-term retrospective time-activity data for environmental health research.

Methods: We included 378 individuals who participated in previous Global Positioning System (GPS) studies within the Washington State Twin Registry. GLH data consists of location information that has been routinely collected since 2010 when location sharing was enabled within android operating systems or Google apps. We created instructions for participants to download their GLH data and provide it through secure data transfer. We summarized the GLH data provided, compared it to available GPS data, and conducted an exposure assessment for nitrogen dioxide (NO2) air pollution.

Results: Of 378 individuals contacted, we received GLH data from 61 individuals (16.1%) and 53 (14.0%) indicated interest but did not have historical GLH data available. The provided GLH data spanned 2010-2021 and included 34 million locations, capturing 66,677 participant days. The median number of days with GLH data per participant was 752, capturing 442 unique locations. When we compared GLH data to 2-wk GPS data (1.8 million points), 95% of GPS time-activity points were within 100m of GLH locations. We observed important differences between NO2 exposures assigned at home locations compared with GLH locations, highlighting the importance of GLH data to environmental exposure assessment.

Discussion: We believe collecting GLH data is a feasible and cost-effective method for capturing retrospective time-activity patterns for large populations that presents new opportunities for environmental epidemiology. Cohort studies should consider adding GLH data collection to capture historical time-activity patterns of participants, employing a "bring-your-own-location-data" citizen science approach. Privacy remains a concern that needs to be carefully managed when using GLH data. https://doi.org/10.1289/EHP10829.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Air Pollutants* / analysis
  • Air Pollution*
  • Environmental Exposure
  • Environmental Health
  • Humans
  • Retrospective Studies
  • Search Engine
  • Smartphone

Substances

  • Air Pollutants