Named entity recognition for Chinese based on global pointer and adversarial training

Hongjun Li; Mingzhe Cheng; Zelin Yang; Liqun Yang; Yansong Chua

doi:10.1038/s41598-023-30355-y

Named entity recognition for Chinese based on global pointer and adversarial training

Sci Rep. 2023 Feb 24;13(1):3242. doi: 10.1038/s41598-023-30355-y.

Authors

Hongjun Li¹, Mingzhe Cheng², Zelin Yang³, Liqun Yang⁴, Yansong Chua⁵

Affiliations

¹ Key Laboratory of Deep-time Geography and Environment Reconstruction and Applications, MNR & College of Computer Science and Cyber Security, Chengdu University of Technology, Chengdu, 610059, China.
² College of Computer Science and Cyber Security, Chengdu University of Technology, Chengdu, 610059, China. cmz1@qq.com.
³ College of Computer Science and Cyber Security, Chengdu University of Technology, Chengdu, 610059, China.
⁴ China Nanhu Academy of Electronics and Information Technology CNAEIT, Jiaxing, 314001, China. yanglq@cnaeit.com.
⁵ China Nanhu Academy of Electronics and Information Technology CNAEIT, Jiaxing, 314001, China.

Abstract

Named entity recognition aims to identify entities from unstructured text and is an important subtask for natural language processing and building knowledge graphs. Most of the existing entity recognition methods use conditional random fields as label decoders or use pointer networks for entity recognition. However, when the number of tags is large, the computational cost of method based on conditional random fields is high and the problem of nested entities cannot be solved. The pointer network uses two modules to identify the first and the last of the entities separately, and a single module can only focus on the information of the first or the last of the entities, but cannot pay attention to the global information of the entities. In addition, the neural network model has the problem of local instability. To solve mentioned problems, a named entity recognition model based on global pointer and adversarial training is proposed. To obtain global entity information, global pointer is used to decode entity information, and rotary relative position information is considered in the model designing to improve the model's perception of position; to solve the model's local instability problem, adversarial training is used to improve the robustness and generalization of the model. The experimental results show that the F1 score of the model are improved on several public datasets of OntoNotes5, MSRA, Resume, and Weibo compared with the existing mainstream models.

Abstract

Grants and funding