Automated ICF Coding of Rehabilitation Notes for Low-Resource Languages via Continual Training of Language Models

Stud Health Technol Inform. 2023 May 18:302:763-767. doi: 10.3233/SHTI230262.

Abstract

The coding of medical documents and in particular of rehabilitation notes using the International Classification of Functioning, Disability and Health (ICF) is a difficult task showing low agreement among experts. Such difficulty is mainly caused by the specific terminology that needs to be used for the task. In this paper, we address the task developing a model based on a large language model, BERT. By leveraging continual training of such a model using ICF textual descriptions, we are able to effectively encode rehabilitation notes expressed in Italian, an under-resourced language.

Keywords: Continual Training; ICF; Language Models; Rehabilitation.

MeSH terms

  • Activities of Daily Living
  • Disability Evaluation
  • Disabled Persons* / rehabilitation
  • Humans
  • International Classification of Functioning, Disability and Health
  • Italy
  • Longitudinal Studies