Learning to Classify Medical Discharge Summaries According to ICD-9

Stud Health Technol Inform. 2023 May 18:302:773-777. doi: 10.3233/SHTI230264.

Abstract

Context: We present a post-hoc approach to improve the recall of ICD classification.

Method: The proposed method can use any classifier as a backbone and aims to calibrate the number of codes returned per document. We test our approach on a new stratified split of the MIMIC-III dataset.

Results: When returning 18 codes on average per document we obtain a recall that is 20% better than a classic classification approach.

Keywords: NLP; Supervised learning; constrained optimization.

MeSH terms

  • Humans
  • International Classification of Diseases*
  • Patient Discharge*