Natural Language Processing Technologies in Radiology Research and Clinical Applications

Radiographics. 2016 Jan-Feb;36(1):176-91. doi: 10.1148/rg.2016150080.

Abstract

The migration of imaging reports to electronic medical record systems holds great potential in terms of advancing radiology research and practice by leveraging the large volume of data continuously being updated, integrated, and shared. However, there are significant challenges as well, largely due to the heterogeneity of how these data are formatted. Indeed, although there is movement toward structured reporting in radiology (ie, hierarchically itemized reporting with use of standardized terminology), the majority of radiology reports remain unstructured and use free-form language. To effectively "mine" these large datasets for hypothesis testing, a robust strategy for extracting the necessary information is needed. Manual extraction of information is a time-consuming and often unmanageable task. "Intelligent" search engines that instead rely on natural language processing (NLP), a computer-based approach to analyzing free-form text or speech, can be used to automate this data mining task. The overall goal of NLP is to translate natural human language into a structured format (ie, a fixed collection of elements), each with a standardized set of choices for its value, that is easily manipulated by computer programs to (among other things) order into subcategories or query for the presence or absence of a finding. The authors review the fundamentals of NLP and describe various techniques that constitute NLP in radiology, along with some key applications.

Publication types

  • Research Support, N.I.H., Extramural
  • Review

MeSH terms

  • Biomedical Research / organization & administration*
  • Data Mining / methods*
  • Electronic Health Records / organization & administration*
  • Humans
  • Machine Learning
  • Natural Language Processing*
  • Pattern Recognition, Automated / methods
  • Radiology / organization & administration*
  • Vocabulary, Controlled*