Lab indicators standardization method for the regional healthcare platform: a case study on heart failure

Ming Liang; ZhiXing Zhang; JiaYing Zhang; Tong Ruan; Qi Ye; Ping He

doi:10.1186/s12911-020-01324-6

Lab indicators standardization method for the regional healthcare platform: a case study on heart failure

BMC Med Inform Decis Mak. 2020 Dec 15;20(Suppl 14):331. doi: 10.1186/s12911-020-01324-6.

Authors

Ming Liang¹, ZhiXing Zhang¹, JiaYing Zhang¹, Tong Ruan², Qi Ye¹, Ping He³

Affiliations

¹ School of Information Science and Engineering, East China University of Science and Technology, 130 Meilong Road, Shanghai, 200237, China.
² School of Information Science and Engineering, East China University of Science and Technology, 130 Meilong Road, Shanghai, 200237, China. ruantong@ecust.edu.cn.
³ Shanghai Hospital Development Center, 2 Kangding Road, Shanghai, 200000, China.

Abstract

Background: Laboratory indicator test results in electronic health records have been applied to many clinical big data analysis. However, it is quite common that the same laboratory examination item (i.e., lab indicator) is presented using different names in Chinese due to the translation problem and the habit problem of various hospitals, which results in distortion of analysis results.

Methods: A framework with a recall model and a binary classification model is proposed, which could reduce the alignment scale and improve the accuracy of lab indicator normalization. To reduce alignment scale, tf-idf is used for candidate selection. To assure the accuracy of output, we utilize enhanced sequential inference model for binary classification. And active learning is applied with a selection strategy which is proposed for reducing annotation cost.

Results: Since our indicator standardization method mainly focuses on Chinese indicator inconsistency, we perform our experiment on Shanghai Hospital Development Center and select clinical data from 8 hospitals. The method achieves a F1-score 92.08[Formula: see text] in our final binary classification. As for active learning, the new strategy proposed performs better than random baseline and could outperform the result trained on full data with only 43[Formula: see text] training data. A case study on heart failure clinic analysis conducted on the sub-dataset collected from SHDC shows that our proposed method is practical in the application with good performance.

Conclusion: This work demonstrates that the structure we proposed can be effectively applied to lab indicator normalization. And active learning is also suitable for this task for cost reduction. Such a method is also valuable in data cleaning, data mining, text extracting and entity alignment.

Keywords: Active learning; Electronic health record; Entity alignment; Heart failure; Lab indicator standardization; Machine learning.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

China
Delivery of Health Care
Electronic Health Records*
Heart Failure* / diagnosis
Humans
Reference Standards