MF-MNER: Multi-models Fusion for MNER in Chinese Clinical Electronic Medical Records

Haoze Du; Jiahao Xu; Zhiyong Du; Lihui Chen; Shaohui Ma; Dongqing Wei; Xianfang Wang

doi:10.1007/s12539-024-00624-z

MF-MNER: Multi-models Fusion for MNER in Chinese Clinical Electronic Medical Records

Interdiscip Sci. 2024 Apr 5. doi: 10.1007/s12539-024-00624-z. Online ahead of print.

Authors

Haoze Du¹, Jiahao Xu², Zhiyong Du³, Lihui Chen², Shaohui Ma³, Dongqing Wei^{4

5

6}, Xianfang Wang⁷

Affiliations

¹ Department of Computer Science, North Carolina State University, Raleigh, NC, 27695, USA.
² School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore, 639798, Singapore.
³ School of Computer Science and Technology, Henan Institute of Technology, Xinxiang, 453003, China.
⁴ State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiaotong University, Shanghai, 200240, China. dqwei@sjtu.edu.cn.
⁵ Joint Laboratory of International Cooperation in Metabolic and Developmental Sciences, Ministry of Education, Shanghai Jiaotong University, Shanghai, 200240, China. dqwei@sjtu.edu.cn.
⁶ Zhongjing Research and Industrialization, Institute of Chinese Medicine, Zhongguancun Scientific Park, Nanyang, 473000, China. dqwei@sjtu.edu.cn.
⁷ School of Computer Science and Technology, Henan Institute of Technology, Xinxiang, 453003, China. 2wangfang@163.com.

PMID: 38578388
DOI: 10.1007/s12539-024-00624-z

Abstract

To address the problem of poor entity recognition performance caused by the lack of Chinese annotation in clinical electronic medical records, this paper proposes a multi-medical entity recognition method F-MNER using a fusion technique combining BART, Bi-LSTM, and CRF. First, after cleaning, encoding, and segmenting the electronic medical records, the obtained semantic representations are dynamically fused using a bidirectional autoregressive transformer (BART) model. Then, sequential information is captured using a bidirectional long short-term memory (Bi-LSTM) network. Finally, the conditional random field (CRF) is used to decode and output multi-task entity recognition. Experiments are performed on the CCKS2019 dataset, with micro avg Precision, macro avg Recall, weighted avg Precision reaching 0.880, 0.887, and 0.883, and micro avg F1-score, macro avg F1-score, weighted avg F1-score reaching 0.875, 0.876, and 0.876 respectively. Compared with existing models, our method outperforms the existing literature in three evaluation metrics (micro average, macro average, weighted average) under the same dataset conditions. In the case of weighted average, the Precision, Recall, and F1-score are 19.64%, 15.67%, and 17.58% higher than the existing BERT-BiLSTM-CRF model respectively. Experiments are performed on the actual clinical dataset with our MF-MNER, the Precision, Recall, and F1-score are 0.638, 0.825, and 0.719 under the micro-avg evaluation mechanism. The Precision, Recall, and F1-score are 0.685, 0.800, and 0.733 under the macro-avg evaluation mechanism. The Precision, Recall, and F1-score are 0.647, 0.825, and 0.722 under the weighted avg evaluation mechanism. The above results show that our method MF-MNER can integrate the advantages of BART, Bi-LSTM, and CRF layers, significantly improving the performance of downstream named entity recognition tasks with a small amount of annotation, and achieving excellent performance in terms of recall score, which has certain practical significance. Source code and datasets to reproduce the results in this paper are available at https://github.com/xfwang1969/MF-MNER .

Keywords: BART; Bi-LSTM; CRF; MNER; Multi-models fusion.

Abstract

Grants and funding