An Overview of Biomolecular Event Extraction from Scientific Documents

Comput Math Methods Med. 2015:2015:571381. doi: 10.1155/2015/571381. Epub 2015 Oct 26.

Abstract

This paper presents a review of state-of-the-art approaches to automatic extraction of biomolecular events from scientific texts. Events involving biomolecules such as genes, transcription factors, or enzymes, for example, have a central role in biological processes and functions and provide valuable information for describing physiological and pathogenesis mechanisms. Event extraction from biomedical literature has a broad range of applications, including support for information retrieval, knowledge summarization, and information extraction and discovery. However, automatic event extraction is a challenging task due to the ambiguity and diversity of natural language and higher-level linguistic phenomena, such as speculations and negations, which occur in biological texts and can lead to misunderstanding or incorrect interpretation. Many strategies have been proposed in the last decade, originating from different research areas such as natural language processing, machine learning, and statistics. This review summarizes the most representative approaches in biomolecular event extraction and presents an analysis of the current state of the art and of commonly used methods, features, and tools. Finally, current research trends and future perspectives are also discussed.

Publication types

  • Review

MeSH terms

  • Animals
  • Computational Biology / methods*
  • Data Mining / methods*
  • Databases, Factual
  • Humans
  • Machine Learning
  • Natural Language Processing
  • Systems Biology