Adversarial training based lattice LSTM for Chinese clinical named entity recognition

Shan Zhao; Zhiping Cai; Haiwen Chen; Ye Wang; Fang Liu; Anfeng Liu

doi:10.1016/j.jbi.2019.103290

Adversarial training based lattice LSTM for Chinese clinical named entity recognition

J Biomed Inform. 2019 Nov:99:103290. doi: 10.1016/j.jbi.2019.103290. Epub 2019 Sep 23.

Authors

Shan Zhao¹, Zhiping Cai², Haiwen Chen¹, Ye Wang¹, Fang Liu³, Anfeng Liu⁴

Affiliations

¹ College of Computer, National University of Defense Technology, Changsha, China.
² College of Computer, National University of Defense Technology, Changsha, China. Electronic address: zpcai@nudt.edu.cn.
³ School of Data and Computer Science, Sun Yat-sen University, Guangzhou, China. Electronic address: liufang@nudt.edu.cn.
⁴ School of Computer Science and Engineering, Central South University, Changsha, China.

PMID: 31557528
DOI: 10.1016/j.jbi.2019.103290

Abstract

Clinical named entity recognition (CNER), which intends to automatically detect clinical entities in electronic health record (EHR), is a committed step for further clinical text mining. Recently, more and more deep learning models are used to Chinese CNER. However, these models do not make full use of the information in EHR, for these models are either word-based or character-based. In addition, neural models tend to be locally unstable and even tiny perturbation may mislead them. In this paper, we firstly propose a novel adversarial training based lattice LSTM with a conditional random field layer (AT-lattice LSTM-CRF) for Chinese CNER. Lattice LSTM is used to capture richer information in EHR. As a powerful regularization method, AT can be used to improve the robustness of neural models by adding perturbations to the training data. Then, we conduct experiments on the proposed neural model with dataset of CCKS-2017 Task 2. The results show that the proposed model achieves a highly competitive performance (with an F1 score of 89.64%) compared to other prevalent neural models, which can be a reinforced baseline for further research in this field.

Keywords: Adversarial training; CRF; Clinical named entity recognition; Lattice LSTM.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

China
Cluster Analysis
Data Mining / methods*
Deep Learning*
Electronic Health Records*
Humans
Language
Medical Informatics
Neural Networks, Computer
Pattern Recognition, Automated