Extracting Chinese events with a joint label space model

Wenzhi Huang; Junchi Zhang; Donghong Ji

doi:10.1371/journal.pone.0272353

Extracting Chinese events with a joint label space model

PLoS One. 2022 Sep 27;17(9):e0272353. doi: 10.1371/journal.pone.0272353. eCollection 2022.

Authors

Wenzhi Huang^{1

2}, Junchi Zhang², Donghong Ji¹

Affiliations

¹ Key Laboratory of Aerospace Information Security and Trusted Computing, Ministry of Education, School of Cyber Science and Engineering, Wuhan University, Wuhan, China.
² School of Computer Science and Engineering, Wuhan Institute of Technology, Wuhan, Hubei, China.

Abstract

The task of event extraction consists of three subtasks namely entity recognition, trigger identification and argument role classification. Recent work tackles these subtasks jointly with the method of multi-task learning for better extraction performance. Despite being effective, existing attempts typically treat labels of event subtasks as uninformative and independent one-hot vectors, ignoring the potential loss of useful label information, thereby making it difficult for these models to incorporate interactive features on the label level. In this paper, we propose a joint label space framework to improve Chinese event extraction. Specifically, the model converts labels of all subtasks into a dense matrix, giving each Chinese character a shared label distribution via an incrementally refined attention mechanism. Then the learned label embeddings are also used as the weight of the output layer for each subtask, hence adjusted along with model training. In addition, we incorporate the word lexicon into the character representation in a soft probabilistic manner, hence alleviating the impact of word segmentation errors. Extensive experiments on Chinese and English benchmarks demonstrate that our model outperforms state-of-the-art methods.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

China
Machine Learning*
Space Simulation*

Grants and funding

This work is supported by the Natural Science Foundation of China (No. 62106179).