Distantly supervised biomedical relation extraction using piecewise attentive convolutional neural network and reinforcement learning

Tiantian Zhu; Yang Qin; Yang Xiang; Baotian Hu; Qingcai Chen; Weihua Peng

doi:10.1093/jamia/ocab176

Distantly supervised biomedical relation extraction using piecewise attentive convolutional neural network and reinforcement learning

J Am Med Inform Assoc. 2021 Nov 25;28(12):2571-2581. doi: 10.1093/jamia/ocab176.

Authors

Tiantian Zhu^{1

2}, Yang Qin¹, Yang Xiang², Baotian Hu¹, Qingcai Chen^{1

2}, Weihua Peng³

Affiliations

¹ Department of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen, China.
² Department of Network Intelligence, Peng Cheng Laboratory, Shenzhen, China.
³ Department of Knowledge Graph, Baidu International Technology (Shenzhen), Shenzhen, China.

Abstract

Objective: There have been various methods to deal with the erroneous training data in distantly supervised relation extraction (RE), however, their performance is still far from satisfaction. We aimed to deal with the insufficient modeling problem on instance-label correlations for predicting biomedical relations using deep learning and reinforcement learning.

Materials and methods: In this study, a new computational model called piecewise attentive convolutional neural network and reinforcement learning (PACNN+RL) was proposed to perform RE on distantly supervised data generated from Unified Medical Language System with MEDLINE abstracts and benchmark datasets. In PACNN+RL, PACNN was introduced to encode semantic information of biomedical text, and the RL method with memory backtracking mechanism was leveraged to alleviate the erroneous data issue. Extensive experiments were conducted on 4 biomedical RE tasks.

Results: The proposed PACNN+RL model achieved competitive performance on 8 biomedical corpora, outperforming most baseline systems. Specifically, PACNN+RL outperformed all baseline methods with the F1-score of 0.5592 on the may-prevent dataset, 0.6666 on the may-treat dataset, and 0.3838 on the DDI corpus, 2011. For the protein-protein interaction RE task, we obtained new state-of-the-art performance on 4 out of 5 benchmark datasets.

Conclusions: The performance on many distantly supervised biomedical RE tasks was substantially improved, primarily owing to the denoising effect of the proposed model. It is anticipated that PACNN+RL will become a useful tool for large-scale RE and other downstream tasks to facilitate biomedical knowledge acquisition. We also made the demonstration program and source code publicly available at http://112.74.48.115:9000/.

Keywords: biomedical relation extraction; deep learning; distant supervision; neural networks; reinforcement learning.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Neural Networks, Computer*
Semantics
Unified Medical Language System*