Evaluating robustness of a generalized linear model when applied to electronic health record data accessed using an Open API

Health Informatics J. 2023 Apr-Jun;29(2):14604582231170892. doi: 10.1177/14604582231170892.

Abstract

The Integrated Clinical and Environmental Exposures Service (ICEES) provides open regulatory-compliant access to clinical data, including electronic health record data, that have been integrated with environmental exposures data. While ICEES has been validated in the context of an asthma use case and several other use cases, the regulatory constraints on the ICEES open application programming interface (OpenAPI) result in data loss when using the service for multivariate analysis. In this study, we investigated the robustness of the ICEES OpenAPI through a comparative analysis, in which we applied a generalized linear model (GLM) to the OpenAPI data and the constraint-free source data to examine factors predictive of asthma exacerbations. Consistent with previous studies, we found that the main predictors identified by both analyses were sex, prednisone, race, obesity, and airborne particulate exposure. Comparison of GLM model fit revealed that data loss impacts model quality, but only with select interaction terms. We conclude that the ICEES OpenAPI supports multivariate analysis, albeit with potential data loss that users should be aware of.

Keywords: asthma; environmental exposures; generalized linear model; open application programming interface; open clinical data.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Asthma* / epidemiology
  • Electronic Health Records*
  • Environmental Exposure
  • Humans
  • Linear Models
  • Software