Natural language processing using online analytic processing for assessing recommendations in radiology reports

J Am Coll Radiol. 2008 Mar;5(3):197-204. doi: 10.1016/j.jacr.2007.09.003.

Abstract

Purpose: The study purpose was to describe the use of natural language processing (NLP) and online analytic processing (OLAP) for assessing patterns in recommendations in unstructured radiology reports on the basis of patient and imaging characteristics, such as age, gender, referring physicians, radiology subspecialty, modality, indications, diseases, and patient status (inpatient vs outpatient).

Materials and methods: A database of 4,279,179 radiology reports from a single tertiary health care center during a 10-year period (1995-2004) was created. The database includes reports of computed tomography, magnetic resonance imaging, fluoroscopy, nuclear medicine, ultrasound, radiography, mammography, angiography, special procedures, and unclassified imaging tests with patient demographics. A clinical data mining and analysis NLP program (Leximer, Nuance Inc, Burlington, Massachusetts) in conjunction with OLAP was used for classifying reports into those with recommendations (I(REC)) and without recommendations (N(REC)) for imaging and determining I(REC) rates for different patient age groups, gender, imaging modalities, indications, diseases, subspecialties, and referring physicians. In addition, temporal trends for I(REC) were also determined.

Results: There was a significant difference in the I(REC) rates in different age groups, varying between 4.8% (10-19 years) and 9.5% (>70 years) (P <.0001). Significant variations in I(REC) rates were observed for different imaging modalities, with the highest rates for computed tomography (17.3%, 100,493/581,032). The I(REC) rates varied significantly for different subspecialties and among radiologists within a subspecialty (P < .0001). For most modalities, outpatients had a higher rate of recommendations when compared with inpatients.

Conclusion: The radiology reports database analyzed with NLP in conjunction with OLAP revealed considerable differences between recommendation trends for different imaging modalities and other patient and imaging characteristics.

MeSH terms

  • Adolescent
  • Adult
  • Age Factors
  • Aged
  • Angiography / methods
  • Child
  • Child, Preschool
  • Cross-Sectional Studies
  • Decision Making, Computer-Assisted*
  • Diagnostic Imaging / methods*
  • Diagnostic Imaging / standards
  • Female
  • Health Planning Guidelines*
  • Humans
  • Infant
  • Magnetic Resonance Imaging / methods
  • Male
  • Middle Aged
  • Natural Language Processing*
  • Quality Control
  • Radiology / standards
  • Radiology Department, Hospital
  • Registries
  • Retrospective Studies
  • Risk Factors
  • Sensitivity and Specificity
  • Sex Factors
  • Tomography, X-Ray Computed / methods
  • Ultrasonography, Doppler / methods
  • United States