Learning to Classify Medical Discharge Summaries According to ICD-9

Leonardo Moros; Jérôme Azé; Sandra Bringay; Pascal Poncelet; Maximilien Servajean; Caroline Dunoyer

doi:10.3233/SHTI230264

Learning to Classify Medical Discharge Summaries According to ICD-9

Stud Health Technol Inform. 2023 May 18:302:773-777. doi: 10.3233/SHTI230264.

Authors

Leonardo Moros¹, Jérôme Azé¹, Sandra Bringay^{1

2}, Pascal Poncelet¹, Maximilien Servajean^{1

2}, Caroline Dunoyer^{3

4}

Affiliations

¹ LIRMM UMR 5506, University of Montpellier, CNRS, Montpellier, France.
² AMIS, Paul-Valéry University, Montpellier, France.
³ Medical Information Department, CHU Montpellier, Montpellier, France.
⁴ IDESP, UMR UA11, INSERM - University of Montpellier, Montpellier, France.

PMID: 37203493
DOI: 10.3233/SHTI230264

Abstract

Context: We present a post-hoc approach to improve the recall of ICD classification.

Method: The proposed method can use any classifier as a backbone and aims to calibrate the number of codes returned per document. We test our approach on a new stratified split of the MIMIC-III dataset.

Results: When returning 18 codes on average per document we obtain a recall that is 20% better than a classic classification approach.

Keywords: NLP; Supervised learning; constrained optimization.

MeSH terms

Humans
International Classification of Diseases*
Patient Discharge*