Develop and Validate a Computable Phenotype for the Identification of Alzheimer's Disease Patients Using Electronic Health Record Data

medRxiv [Preprint]. 2024 Feb 6:2024.02.06.24302389. doi: 10.1101/2024.02.06.24302389.

Abstract

Introduction: Alzheimer's Disease (AD) are often misclassified in electronic health records (EHRs) when relying solely on diagnostic codes. This study aims to develop a more accurate, computable phenotype (CP) for identifying AD patients by using both structured and unstructured EHR data.

Methods: We used EHRs from the University of Florida Health (UF Health) system and created rule-based CPs iteratively through manual chart reviews. The CPs were then validated using data from the University of Texas Health Science Center at Houston (UT Health) and the University of Minnesota (UMN).

Results: Our best-performing CP is " patient has at least 2 AD diagnoses and AD-related keywords " with an F1-score of 0.817 at UF, and 0.961 and 0.623 at UT Health and UMN, respectively.

Discussion: We developed and validated rule-based CPs for AD identification with good performance, crucial for studies that aim to use real-world data like EHRs.

Publication types

  • Preprint