Methodology for Using Real-World Data From Electronic Health Records to Assess Chemotherapy Administration in Women With Breast Cancer

JCO Clin Cancer Inform. 2024 Apr:8:e2300209. doi: 10.1200/CCI.23.00209.

Abstract

Purpose: Identification of patients' intended chemotherapy regimens is critical to most research questions conducted in the real-world setting of cancer care. Yet, these data are not routinely available in electronic health records (EHRs) at the specificity required to address these questions. We developed a methodology to identify patients' intended regimens from EHR data in the Optimal Breast Cancer Chemotherapy Dosing (OBCD) study.

Methods: In women older than 18 years, diagnosed with primary stage I-IIIA breast cancer at Kaiser Permanente Northern California (2006-2019), we categorized participants into 24 drug combinations described in National Comprehensive Cancer Network guidelines for breast cancer treatment. Participants were categorized into 50 guideline chemotherapy administration schedules within these combinations using an iterative algorithm process, followed by chart abstraction where necessary. We also identified patients intended to receive nonguideline administration schedules within guideline drug combinations and nonguideline drug combinations. This process was adapted at Kaiser Permanente Washington using abstracted data (2004-2015).

Results: In the OBCD cohort, 13,231 women received adjuvant or neoadjuvant chemotherapy, of whom 10,213 (77%) had their intended regimen identified via the algorithm, 2,416 (18%) had their intended regimen identified via abstraction, and 602 (4.5%) could not be identified. Across guideline drug combinations, 111 nonguideline dosing schedules were used, alongside 61 nonguideline drug combinations. A number of factors were associated with requiring abstraction for regimen determination, including: decreasing neighborhood household income, earlier diagnosis year, later stage, nodal status, and human epidermal growth factor receptor 2 (HER2)+ status.

Conclusion: We describe the challenges and approaches to operationalize complex, real-world data to identify intended chemotherapy regimens in large, observational studies. This methodology can improve efficiency of use of large-scale clinical data in real-world populations, helping answer critical questions to improve care delivery and patient outcomes.

MeSH terms

  • Breast Neoplasms* / diagnosis
  • Breast Neoplasms* / drug therapy
  • Breast Neoplasms* / epidemiology
  • Drug Combinations
  • Electronic Health Records
  • Female
  • Humans

Substances

  • Drug Combinations