Retrospective study of propionic acidemia using natural language processing in Mayo Clinic electronic health record data

Mol Genet Metab. 2023 Nov;140(3):107695. doi: 10.1016/j.ymgme.2023.107695. Epub 2023 Sep 2.

Abstract

Background: Propionic acidemia (PA) is a rare autosomal recessive organic acidemia that classically presents within the first days of life with a metabolic crisis or via newborn screening and is confirmed with laboratory tests. Limited data exist on the natural history of patients with PA describing presentation, treatments, and clinical outcomes.

Objective: To retrospectively describe the natural history of patients with PA in a clinical setting from a real-world database using both structured and unstructured electronic health record (EHR) data using novel data extraction techniques in a unique care setting.

Design/methods: This retrospective study used EHR data to identify patients with PA seen at the Mayo Clinic. Unstructured clinical text (medical notes, pathology reports) were analyzed using augmented curation natural language processing models to enhance analysis of data extracted by structured data fields (International Classification of Diseases 9th or 10th revision [ICD-9/-10] codes, Current Procedural Terminology [CPT] codes, and medication orders). De-identified health records were also manually reviewed by clinical scientists to ensure data accuracy and completeness. The index date was defined as the patient's date of PA diagnosis at the Mayo Clinic. Results were reported as aggregate descriptive statistics relative to patients' index dates. Complications, therapeutic interventions, laboratory tests, procedures, and hospitalization encounters related to PA were described at and within 6 months of the patient's index date, and from medical history available before the index date.

Results: In total, 13 patients with PA were identified, with visits occurring from 1998 to 2022. Age at diagnosis ranged from birth to 3 years; age at initial evaluation at the Mayo Clinic ranged from 3 days to 28 years. The mean number of Mayo Clinic outpatient visits was 31 (median duration of care, 2 years). PA-related complications were documented in 85% of patients and included nutritional difficulties (46%), metabolic decompensation events (MDEs; 38%), neurologic abnormalities (38%), and cardiomyopathy (7%). One pair of affected siblings had mild symptoms and no complications or MDEs. All 5 patients with a history of MDEs presented with developmental delays. Among patients with MDEs, the mean frequency of outpatient clinical care visits was 10 per year, and 3 patients required inpatient hospitalization (mean duration, 16 days). The incidence of severe complications was higher among patients with MDEs than those without MDEs. Of the patients with MDEs, 2 experienced crises while receiving treatment at the Mayo Clinic, with 9 total MDEs occurring between the 2 patients. Symptoms at presentation included hyperammonemia (78%), fever and/or decreased nutritional intake (67%), hyperglycemia/hypoglycemia (56%), intercurrent upper respiratory infection and/or lethargy (44%), constipation (33%), altered mental status (33%), and cough (33%).

Conclusions: This study highlights the range and frequency of clinical outcomes experienced by patients with PA and demonstrates the clinical burden of MDEs.

Keywords: Augmented curation model; Electronic health records; Metabolic decompensation events; Natural language processing; Propionic acidemia; Real-world data.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Child, Preschool
  • Electronic Health Records
  • Humans
  • Infant, Newborn
  • Natural Language Processing
  • Neonatal Screening / methods
  • Propionic Acidemia* / complications
  • Propionic Acidemia* / diagnosis
  • Propionic Acidemia* / epidemiology
  • Retrospective Studies