Automated Type 2 Diabetes Case and Control Identification from the MIMIC-IV Database

AMIA Jt Summits Transl Sci Proc. 2023 Jun 16:2023:602-611. eCollection 2023.

Abstract

Phenotyping for Type 2 Diabetes (T2DM) is needed due to the increasing demand for T2DM research on electronic health records (EHRs). eMERGE is a reliable and interpretable rule-based algorithm for the identification of T2DM cases and controls in EHRs. MIMIC-IV, an extension of MIMIC-III, contains more than 520,000 hospital admissions and has become a valuable EHR database for secondary medical research. However, there was no prior work to extract T2DM cases and controls from MIMIC-IV, which requires a comprehensive knowledge of the database. Our work provided insight into the structure and data elements in MIMIC-IV and adapted eMERGE to accomplish the task. The results included MIMIC-IV's data tables and elements used, 12,735 cases and 9,828 controls of T2DM, and summary statistics of the cohorts in comparison with those on other EHR databases. They could be used for the development of statistical and machine learning models in future studies about the disease.