Leveraging Collaborative Filtering to Accelerate Rare Disease Diagnosis

AMIA Annu Symp Proc. 2018 Apr 16:2017:1554-1563. eCollection 2017.

Abstract

In the USA, rare diseases are defined as those affecting fewer than 200,000 patients at any given time. Patients with rare diseases are frequently misdiagnosed or undiagnosed which may due to the lack of knowledge and experience of care providers. We hypothesize that patients' phenotypic information available in electronic medical records (EMR) can be leveraged to accelerate disease diagnosis based on the intuition that providers need to document associated phenotypic information to support the diagnosis decision, especially for rare diseases. In this study, we proposed a collaborative filtering system enriched with natural language processing and semantic techniques to assist rare disease diagnosis based on phenotypic characterization. Specifically, we leveraged four similarity measurements with two neighborhood algorithms on 2010-2015 Mayo Clinic unstructured large patient cohort and evaluated different approaches. Preliminary results demonstrated that the use of collaborative filtering with phenotypic information is able to stratify patients with relatively similar rare diseases.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Algorithms*
  • Cohort Studies
  • Electronic Health Records*
  • Humans
  • Natural Language Processing*
  • Phenotype*
  • Rare Diseases / diagnosis*
  • Rare Diseases / genetics
  • Semantics
  • Symptom Assessment
  • Unified Medical Language System