Risks and Opportunities to Ensure Equity in the Application of Big Data Research in Public Health

Annu Rev Public Health. 2022 Apr 5:43:59-78. doi: 10.1146/annurev-publhealth-051920-110928. Epub 2021 Dec 6.

Abstract

The big data revolution presents an exciting frontier to expand public health research, broadening the scope of research and increasing the precision of answers. Despite these advances, scientists must be vigilant against also advancing potential harms toward marginalized communities. In this review, we provide examples in which big data applications have (unintentionally) perpetuated discriminatory practices, while also highlighting opportunities for big data applications to advance equity in public health. Here, big data is framed in the context of the five Vs (volume, velocity, veracity, variety, and value), and we propose a sixth V, virtuosity, which incorporates equity and justice frameworks. Analytic approaches to improving equity are presented using social computational big data, fairness in machine learning algorithms, medical claims data, and data augmentation as illustrations. Throughout, we emphasize the biasing influence of data absenteeism and positionality and conclude with recommendations for incorporating an equity lens into big data research.

Keywords: health equity; machine learning; multilevel models; multiple systems estimation.

Publication types

  • Review
  • Research Support, Non-U.S. Gov't
  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms
  • Bias
  • Big Data*
  • Humans
  • Machine Learning
  • Public Health*