Supporting the Diagnosis of Fabry Disease Using a Natural Language Processing-Based Approach

J Clin Med. 2023 May 22;12(10):3599. doi: 10.3390/jcm12103599.

Abstract

In clinical practice, the consideration of non-specific symptoms of rare diseases in order to make a correct and timely diagnosis is often challenging. To support physicians, we developed a decision-support scoring system on the basis of retrospective research. Based on the literature and expert knowledge, we identified clinical features typical for Fabry disease (FD). Natural language processing (NLP) was used to evaluate patients' electronic health records (EHRs) to obtain detailed information about FD-specific patient characteristics. The NLP-determined elements, laboratory test results, and ICD-10 codes were transformed and grouped into pre-defined FD-specific clinical features that were scored in the context of their significance in the FD signs. The sum of clinical feature scores constituted the FD risk score. Then, medical records of patients with the highest FD risk score were reviewed by physicians who decided whether to refer a patient for additional tests or not. One patient who obtained a high-FD risk score was referred for DBS assay and confirmed to have FD. The presented NLP-based, decision-support scoring system achieved AUC of 0.998, which demonstrates that the applied approach enables for accurate identification of FD-suspected patients, with a high discrimination power.

Keywords: EHR; Fabry disease; NLP; clinical diagnosis support system; decision-support; electronic health record; natural language processing; rare disease; risk factor.

Grants and funding

The source of funding did not affect the data collection, analysis, or the results described in this manuscript. The authors confirm that they had full access to all of the data in this study and take complete responsibility for the integrity of the data and the accuracy of the data analysis.