Extraction and mapping of drug names from free text to a standardized nomenclature

Matthew A Levin; Marina Krol; Ankur M Doshi; David L Reich

Extraction and mapping of drug names from free text to a standardized nomenclature

AMIA Annu Symp Proc. 2007 Oct 11:2007:438-42.

Authors

Matthew A Levin¹, Marina Krol, Ankur M Doshi, David L Reich

Affiliation

¹ Department of Anesthesiology, Mount Sinai School of Medicine, New York, NY, USA.

PMID: 18693874
PMCID: PMC2655777

Abstract

Free text fields are often used to store clinical drug data in electronic health records. The use of free text facilitates rapid data entry by the clinician. Errors in spelling, abbreviations, and jargon, however, limit the utility of these data. We designed and implemented an algorithm, using open source tools and RxNorm, to extract and normalize drug data stored in free text fields of an anesthesia electronic health record. The algorithm was developed using a training set containing drug data from 49,518 cases, and validated using a validation set containing data from 14,655 cases. Overall sensitivity and specificity for the validation set were 92.2% and 95.7% respectively. The mains sources of error were misspellings and unknown but valid drug names. These preliminary results demonstrate that free text clinical drug data can be efficiently extracted and mapped to a controlled drug nomenclature.

Publication types

Validation Study

MeSH terms

Abstracting and Indexing
Algorithms*
Anesthesiology
Humans
Information Storage and Retrieval / methods
Medical Records Systems, Computerized*
Natural Language Processing*
Pharmaceutical Preparations / classification*
Terminology as Topic*

Substances

Pharmaceutical Preparations