Geospatial divide in real-world EHR data: Analytical workflow to assess regional biases and potential impact on health equity

AMIA Jt Summits Transl Sci Proc. 2023 Jun 16:2023:572-581. eCollection 2023.

Abstract

Real-world data (RWD) like electronic health records (EHR) has great potential for secondary use by health systems and researchers. However, collected primarily for efficient health care, EHR data may not equitably represent local regions and populations, impacting the generalizability of insights learned from it. We assessed the geospatial representativeness of regions in a large health system EHR data using a spatial analysis workflow, which provides a data-driven way to quantify geospatial representation and identify adequately represented regions. We applied the workflow to investigate geospatial patterns of overweight/obesity and depression patients to find regional "hotspots" for potential targeted interventions. Our findings show the presence of geospatial bias in EHR and demonstrate the workflow to identify spatial clusters after adjusting for bias due to the geospatial representativeness. This work highlights the importance of evaluating geospatial representativeness in RWD to guide targeted deployment of limited healthcare resources and generate equitable real-world evidence.