Toward Electronic Surveillance of Invasive Mold Diseases in Hematology-Oncology Patients: An Expert System Combining Natural Language Processing of Chest Computed Tomography Reports, Microbiology, and Antifungal Drug Data

JCO Clin Cancer Inform. 2017 Nov:1:1-10. doi: 10.1200/CCI.17.00011.

Abstract

Purpose: Prospective epidemiologic surveillance of invasive mold disease (IMD) in hematology patients is hampered by the absence of a reliable laboratory prompt. This study develops an expert system for electronic surveillance of IMD that combines probabilities using natural language processing (NLP) of computed tomography (CT) reports with microbiology and antifungal drug data to improve prediction of IMD.

Methods: Microbiology indicators and antifungal drug-dispensing data were extracted from hospital information systems at three tertiary hospitals for 123 hematology-oncology patients. Of this group, 64 case patients had 26 probable/proven IMD according to international definitions, and 59 patients were uninfected controls. Derived probabilities from NLP combined with medical expertise identified patients at high likelihood of IMD, with remaining patients processed by a machine-learning classifier trained on all available features.

Results: Compared with the baseline text classifier, the expert system that incorporated the best performing algorithm (naïve Bayes) improved specificity from 50.8% (95% CI, 37.5% to 64.1%) to 74.6% (95% CI, 61.6% to 85.0%), reducing false positives by 48% from 29 to 15; improved sensitivity slightly from 96.9% (95% CI, 89.2% to 99.6%) to 98.4% (95% CI, 91.6% to 100%); and improved receiver operating characteristic area from 73.9% (95% CI, 67.1% to 80.6%) to 92.8% (95% CI, 88% to 97.5%).

Conclusion: An expert system that uses multiple sources of data (CT reports, microbiology, antifungal drug dispensing) is a promising approach to continuous prospective surveillance of IMD in the hospital, and demonstrates reduced false notifications (positives) compared with NLP of CT reports alone. Our expert system could provide decision support for IMD surveillance, which is critical to antifungal stewardship and improving supportive care in cancer.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Aged
  • Aged, 80 and over
  • Algorithms
  • Antifungal Agents / therapeutic use
  • Case-Control Studies
  • Combined Modality Therapy
  • Electronic Health Records
  • Expert Systems
  • Female
  • Humans
  • Invasive Fungal Infections / diagnosis*
  • Invasive Fungal Infections / etiology
  • Invasive Fungal Infections / therapy*
  • Machine Learning
  • Male
  • Medical Oncology* / methods
  • Microbiological Techniques
  • Middle Aged
  • Monitoring, Physiologic / methods*
  • Natural Language Processing
  • Neoplasms / complications
  • Neoplasms / diagnosis*
  • Neoplasms / therapy*
  • ROC Curve
  • Sensitivity and Specificity
  • Telemedicine / methods*
  • Tomography, X-Ray Computed
  • Young Adult

Substances

  • Antifungal Agents