Cross-Language Terminology Mapping Between ICD-10-CN and SNOMED-CT

Stud Health Technol Inform. 2022 Jun 6:290:42-46. doi: 10.3233/SHTI220028.

Abstract

The objective of this study was to develop a hybrid method and perform an initial evaluation of mappings from the International Statistical Classification of Diseases, 10th revision, Chinese version (ICD-10-CN) to the Systematized Nomenclature of Medicine - Clinical Terms (SNOMED-CT). The methods used to perform mapping include reusing existing mappings, term similarity modeling for automatic mapping and manual review. We evaluated the results of automatic mapping and the coverage of the maps between two terminologies. Experimental results demonstrated that fine-tuning the pre-trained biomedical language model of PubmedBERT obtained the optimal performance, with a precision of 0.859, a recall of 0.773, and a F1 of 0.814. 100% 4-digit code ICD-10-CN terms were mapped to SNOMED-CT terms through exsit code mappings. Around 42.41% randomly selected 6-digit code ICD-10-CN terms had exact matches to corresponding SNOMED-CT terms, and we did not find appropriate SNOMED-CT terms for ICD grouping terms.

Keywords: Controlled; Health Information Interoperability; Terminology; Vocabulary.

MeSH terms

  • International Classification of Diseases*
  • Language
  • Systematized Nomenclature of Medicine*