The accuracy of race & ethnicity data in US based healthcare databases: A systematic review

Am J Surg. 2023 Oct;226(4):463-470. doi: 10.1016/j.amjsurg.2023.05.011. Epub 2023 May 18.

Abstract

Background: The availability and accuracy of data on a patient's race/ethnicity varies across databases. Discrepancies in data quality can negatively impact attempts to study health disparities.

Methods: We conducted a systematic review to organize information on the accuracy of race/ethnicity data stratified by database type and by specific race/ethnicity categories.

Results: The review included 43 studies. Disease registries showed consistently high levels of data completeness and accuracy. EHRs frequently showed incomplete and/or inaccurate data on the race/ethnicity of patients. Databases had high levels of accurate data for White and Black patients but relatively high levels of misclassification and incomplete data for Hispanic/Latinx patients. Asians, Pacific Islanders, and AI/ANs are the most misclassified. Systems-based interventions to increase self-reported data showed improvement in data quality.

Conclusion: Data on race/ethnicity that is collected with the purpose of research and quality improvement appears most reliable. Data accuracy can vary by race/ethnicity status and better collection standards are needed.

Keywords: Data accuracy; Database; Disparities; Health equity; Systematic review.

Publication types

  • Systematic Review

MeSH terms

  • American Indian or Alaska Native
  • Asian
  • Black or African American
  • Data Management* / organization & administration
  • Data Management* / standards
  • Data Management* / statistics & numerical data
  • Ethnicity* / statistics & numerical data
  • Healthcare Disparities / ethnology
  • Healthcare Disparities / standards
  • Healthcare Disparities / statistics & numerical data
  • Hispanic or Latino
  • Humans
  • Pacific Island People
  • Racial Groups* / ethnology
  • Racial Groups* / statistics & numerical data
  • White