Normalizing Spontaneous Reports Into MedDRA: Some Experiments With MagiCoder

IEEE J Biomed Health Inform. 2019 Jan;23(1):95-102. doi: 10.1109/JBHI.2018.2861213. Epub 2018 Jul 30.

Abstract

Text normalization into medical dictionaries is useful to support clinical tasks. A typical setting is pharmacovigilance (PV). The manual detection of suspected adverse drug reactions (ADRs) in narrative reports is time consuming and natural language processing (NLP) provides a concrete help to PV experts. In this paper, we carry out experiments for testing performances of MagiCoder, an NLP application designed to extract MedDRA terms from narrative clinical text. Given a narrative description, MagiCoder proposes an automatic encoding. The pharmacologist reviews, (possibly) corrects, and then, validates the solution. This drastically reduces the time needed for the validation of reports with respect to a completely manual encoding. In previous work, we mainly tested MagiCoder performances on Italian written spontaneous reports. In this paper, we include some new features, change the experiment design, and carry on more tests about MagiCoder. Moreover, we do a change of language, moving to English documents. In particular, we tested MagiCoder on the CADEC dataset, a corpus of manually annotated posts about ADRs collected from the social media.

MeSH terms

  • Data Mining / methods*
  • Drug-Related Side Effects and Adverse Reactions*
  • Humans
  • Medical Informatics / methods*
  • Natural Language Processing*
  • Pharmacovigilance*