Pretrained Neural Networks Accurately Identify Cancer Recurrence in Medical Record

Stud Health Technol Inform. 2022 May 25:294:93-97. doi: 10.3233/SHTI220403.

Abstract

Cancer recurrence is the diagnosis of a second clinical episode of cancer after the first was considered cured. Identifying patients who had experienced cancer recurrence is an important task as it can be used to compare treatment effectiveness, measure recurrence-free survival, and plan and prioritize cancer control resources. We developed BERT-based natural language processing (NLP) contextual models for identifying cancer recurrence incidence and the recurrence time based on the records in progress notes. Using two datasets containing breast and colorectal cancer patients, we demonstrated the advantage of the contextual models over the traditional NLP models by overcoming the laborious and often unscalable tasks of composing keywords in a specific disease domain.

Keywords: BERT; Breast cancer; Cancer recurrence; ClinicalBioBert architecture; Colorectal cancer; Natural language processing; Real-world data.

MeSH terms

  • Electronic Health Records
  • Humans
  • Natural Language Processing*
  • Neoplasms* / diagnosis
  • Neural Networks, Computer