'Clustering' documents automatically to support scoping reviews of research: a case study

Res Synth Methods. 2013 Sep;4(3):230-41. doi: 10.1002/jrsm.1082. Epub 2013 Jul 3.

Abstract

Background: Scoping reviews of research help determine the feasibility and the resource requirements of conducting a systematic review, and the potential to generate a description of the literature quickly is attractive.

Aims: To test the utility and applicability of an automated clustering tool to describe and group research studies to improve the efficiency of scoping reviews.

Methods: A retrospective study of two completed scoping reviews was conducted. This compared the groups and descriptive categories obtained by automatically clustering titles and abstracts with those that had originally been derived using traditional researcher-driven techniques.

Results: The clustering tool rapidly categorised research into themes, which were useful in some instances, but not in others. This provided a dynamic means to view each dataset. Interpretation was challenging where there were potentially multiple meanings of terms. Where relevant clusters were unambiguous, there was a high precision of relevant studies, although recall varied widely.

Conclusions: Policy-relevant scoping reviews are often undertaken rapidly, and this could potentially be enhanced by automation depending on the nature of the dataset and information sought. However, it is not a replacement for researcher-developed classification. The possibilities of further applications and potential for use in other types of review are discussed.

Keywords: automatic clustering; automation; information storage and retrieval; methods, mapping; scoping reviews; text mining.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cluster Analysis
  • Data Mining / methods*
  • Documentation / classification*
  • Machine Learning
  • Natural Language Processing*
  • Periodicals as Topic / classification
  • Research Report*
  • Review Literature as Topic*
  • Vocabulary, Controlled*