Evaluating sentence representations for biomedical text: Methods and experimental results

J Biomed Inform. 2020 Apr:104:103396. doi: 10.1016/j.jbi.2020.103396. Epub 2020 Mar 6.

Abstract

Text representations ar one of the main inputs to various Natural Language Processing (NLP) methods. Given the fast developmental pace of new sentence embedding methods, we argue that there is a need for a unified methodology to assess these different techniques in the biomedical domain. This work introduces a comprehensive evaluation of novel methods across ten medical classification tasks. The tasks cover a variety of BioNLP problems such as semantic similarity, question answering, citation sentiment analysis and others with binary and multi-class datasets. Our goal is to assess the transferability of different sentence representation schemes to the medical and clinical domain. Our analysis shows that embeddings based on Language Models which account for the context-dependent nature of words, usually outperform others in terms of performance. Nonetheless, there is no single embedding model that perfectly represents biomedical and clinical texts with consistent performance across all tasks. This illustrates the need for a more suitable bio-encoder. Our MedSentEval source code, pre-trained embeddings and examples have been made available on GitHub.

Keywords: BioNLP; Language model; Sentence embeddings; Text representation.

MeSH terms

  • Language*
  • Natural Language Processing*
  • Semantics
  • Software