Towards Chinese clinical named entity recognition by dynamic embedding using domain-specific knowledge

J Biomed Inform. 2020 Jun:106:103435. doi: 10.1016/j.jbi.2020.103435. Epub 2020 Apr 29.

Abstract

The task of electronic medical record named entity recognition (NER) refers to automatically identify all kinds of named entities in the medical record text. Chinese clinical NER remains a major challenge. One of the main reasons is that Chinese word segmentation will lead to the wrong downstream works. Besides, existing methods only use the information of the general field, not consider the knowledge from field of medicine. To address these issues, we propose a dynamic embedding method based on dynamic attention which combines features of both character and word in embedding layer. Domain knowledge is provided by word vector trained by domain dataset. In addition, spatial attention is added to enable the model to obtain more and more effective context encoding information. Finally, we conduct extensive experiments to demonstrate the effectiveness of our proposed algorithm. Experiments on CCKS2017 and Common dataset shows that the proposed method outperforms the baseline.

Keywords: Chinese electronic medical record; Domain-specific knowledge; Dynamic embedding; Named entity recognition.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Algorithms
  • Attention
  • China
  • Electronic Health Records*
  • Text Messaging*