Semantic Changepoint Detection for Finding Potentially Novel Research Publications

Pac Symp Biocomput. 2021:26:107-118.

Abstract

How has the focus of research papers on a given disease changed over time? Identifying the papers at the cusps of change can help highlight the emergence of a new topic or a change in the direction of research. We present a generally applicable unsupervised approach to this question based on semantic changepoints within a given collection of research papers. We illustrate the approach by a range of examples based on a nascent corpus of literature on COVID-19 as well as subsets of papers from PubMed on the World Health Organization list of neglected tropical diseases. The software is freely available at: https://github.com/pdddinakar/SemanticChangepointDetection.

MeSH terms

  • COVID-19*
  • Computational Biology
  • Humans
  • PubMed
  • SARS-CoV-2
  • Semantics*