Phenotype Concept Set Construction from Concept Pair Likelihoods

AMIA Annu Symp Proc. 2021 Jan 25:2020:1080-1089. eCollection 2020.

Abstract

Phenotyping algorithms are essential tools for conducting clinical research on observational data. Manually devel- oped phenotyping algorithms, such as those curated within the eMERGE (electronic Medical Records and Genomics) Network, represent the gold standard but are time consuming to create. In this work, we propose a framework for learning from the structure of eMERGE phenotype concept sets to assist construction of novel phenotype definitions. We use eMERGE phenotypes as a source of reference concept sets and engineer rich features characterizing the con- cept pairs within each set. We treat these pairwise relationships as edges in a concept graph, train models to perform edge prediction, and identify candidate phenotype concept sets as highly connected subgraphs. Candidate concept sets may then be interrogated and composed to construct novel phenotype definitions.

MeSH terms

  • Algorithms*
  • Electronic Health Records*
  • Genomics*
  • Humans
  • Phenotype*
  • Probability