Bio-semantic relation extraction with attention-based external knowledge reinforcement

BMC Bioinformatics. 2020 May 24;21(1):213. doi: 10.1186/s12859-020-3540-8.

Abstract

Background: Semantic resources such as knowledge bases contains high-quality-structured knowledge and therefore require significant effort from domain experts. Using the resources to reinforce the information retrieval from the unstructured text may further exploit the potentials of such unstructured text resources and their curated knowledge.

Results: The paper proposes a novel method that uses a deep neural network model adopting the prior knowledge to improve performance in the automated extraction of biological semantic relations from the scientific literature. The model is based on a recurrent neural network combining the attention mechanism with the semantic resources, i.e., UniProt and BioModels. Our method is evaluated on the BioNLP and BioCreative corpus, a set of manually annotated biological text. The experiments demonstrate that the method outperforms the current state-of-the-art models, and the structured semantic information could improve the result of bio-text-mining.

Conclusion: The experiment results show that our approach can effectively make use of the external prior knowledge information and improve the performance in the protein-protein interaction extraction task. The method should be able to be generalized for other types of data, although it is validated on biomedical texts.

Keywords: Attention mechanism; Bio-text-mining; Biological semantic relation; Knowledge base.

MeSH terms

  • Algorithms*
  • Attention / physiology*
  • Databases, Genetic
  • Humans
  • Knowledge Bases*
  • Neural Networks, Computer
  • Publications
  • Semantics*