Clinical Information Extraction at the CLEF eHealth Evaluation lab 2016

Aurélie Névéol; K Bretonnel Cohen; Cyril Grouin; Thierry Hamon; Thomas Lavergne; Liadh Kelly; Lorraine Goeuriot; Grégoire Rey; Aude Robert; Xavier Tannier; Pierre Zweigenbaum

Clinical Information Extraction at the CLEF eHealth Evaluation lab 2016

CEUR Workshop Proc. 2016 Sep:1609:28-42.

Authors

Aurélie Névéol¹, K Bretonnel Cohen^{1

2}, Cyril Grouin¹, Thierry Hamon^{1

3}, Thomas Lavergne^{1

4}, Liadh Kelly⁵, Lorraine Goeuriot⁶, Grégoire Rey⁷, Aude Robert⁷, Xavier Tannier^{1

4}, Pierre Zweigenbaum¹

Affiliations

¹ LIMSI, CNRS, Université Paris-Saclay, Orsay, France.
² University of Colorado, USA.
³ Université Paris Nord, Villetaneuse, France.
⁴ Univ. Paris-Sud, Orsay, France.
⁵ ADAPT Centre, Trinity College, Dublin, Ireland.
⁶ Université Grenoble Alpes, Grenoble, France.
⁷ INSERM-CépiDC, Paris, France.

PMID: 29308065
PMCID: PMC5756095

Abstract

This paper reports on Task 2 of the 2016 CLEF eHealth evaluation lab which extended the previous information extraction tasks of ShARe/CLEF eHealth evaluation labs. The task continued with named entity recognition and normalization in French narratives, as offered in CLEF eHealth 2015. Named entity recognition involved ten types of entities including disorders that were defined according to Semantic Groups in the Unified Medical Language System^® (UMLS^®), which was also used for normalizing the entities. In addition, we introduced a large-scale classification task in French death certificates, which consisted of extracting causes of death as coded in the International Classification of Diseases, tenth revision (ICD10). Participant systems were evaluated against a blind reference standard of 832 titles of scientific articles indexed in MEDLINE, 4 drug monographs published by the European Medicines Agency (EMEA) and 27,850 death certificates using Precision, Recall and F-measure. In total, seven teams participated, including five in the entity recognition and normalization task, and five in the death certificate coding task. Three teams submitted their systems to our newly offered reproducibility track. For entity recognition, the highest performance was achieved on the EMEA corpus, with an overall F-measure of 0.702 for plain entities recognition and 0.529 for normalized entity recognition. For entity normalization, the highest performance was achieved on the MEDLINE corpus, with an overall F-measure of 0.552. For death certificate coding, the highest performance was 0.848 F-measure.

Keywords: Biomedical Text; Entity Linking; French; Named Entity Recognition; Natural Language Processing; Text Classification; UMLS.

Abstract

Grants and funding