Automatic Mapping of Terminology Items with Transformers

AMIA Annu Symp Proc. 2024 Jan 11:2023:599-607. eCollection 2023.

Abstract

Biomedical ontologies are a key component in many systems for the analysis of textual clinical data. They are employed to organize information about a certain domain relying on a hierarchy of different classes. Each class maps a concept to items in a terminology developed by domain experts. These mappings are then leveraged to organize the information extracted by Natural Language Processing (NLP) models to build knowledge graphs for inferences. The creation of these associations, however, requires extensive manual review. In this paper, we present an automated approach and repeatable framework to learn a mapping between ontology classes and terminology terms derived from vocabularies in the Unified Medical Language System (UMLS) metathesaurus. According to our evaluation, the proposed system achieves a performance close to humans and provides a substantial improvement over existing systems developed by the National Library of Medicine to assist researchers through this process.

MeSH terms

  • Biological Ontologies*
  • Humans
  • National Library of Medicine (U.S.)
  • Natural Language Processing
  • Unified Medical Language System*
  • United States