FlauBERT vs. CamemBERT: Understanding patient's answers by a French medical chatbot

Corentin Blanc; Alexandre Bailly; Élie Francis; Thierry Guillotin; Fadi Jamal; Béchara Wakim; Pascal Roy

doi:10.1016/j.artmed.2022.102264

FlauBERT vs. CamemBERT: Understanding patient's answers by a French medical chatbot

Artif Intell Med. 2022 May:127:102264. doi: 10.1016/j.artmed.2022.102264. Epub 2022 Mar 2.

Authors

Corentin Blanc¹, Alexandre Bailly², Élie Francis³, Thierry Guillotin³, Fadi Jamal⁴, Béchara Wakim⁵, Pascal Roy⁶

Affiliations

¹ Everteam Software, Lyon, France; Université de Lyon, Lyon, France; Université Lyon 1, Villeurbanne, France; Service de Biostatistique-Bioinformatique, Pôle Santé Publique, Hospices Civils de Lyon, Lyon, France; Équipe Biostatistique-Santé, Laboratoire de Biométrie et Biologie Évolutive, CNRS UMR 5558, Villeurbanne, France. Electronic address: c.blanc@everteam.com.
² Everteam Software, Lyon, France; Université de Lyon, Lyon, France; Université Lyon 1, Villeurbanne, France; Service de Biostatistique-Bioinformatique, Pôle Santé Publique, Hospices Civils de Lyon, Lyon, France; Équipe Biostatistique-Santé, Laboratoire de Biométrie et Biologie Évolutive, CNRS UMR 5558, Villeurbanne, France.
³ Everteam Software, Lyon, France.
⁴ IzyCardio, Lyon, France.
⁵ Mediapps Innovation, Lyon, France.
⁶ Université de Lyon, Lyon, France; Université Lyon 1, Villeurbanne, France; Service de Biostatistique-Bioinformatique, Pôle Santé Publique, Hospices Civils de Lyon, Lyon, France; Équipe Biostatistique-Santé, Laboratoire de Biométrie et Biologie Évolutive, CNRS UMR 5558, Villeurbanne, France.

PMID: 35430035
DOI: 10.1016/j.artmed.2022.102264

Abstract

In a number of circumstances, obtaining health-related information from a patient is time-consuming, whereas a chatbot interacting efficiently with that patient might help saving health care professional time and better assisting the patient. Making a chatbot understand patients' answers uses Natural Language Understanding (NLU) technology that relies on 'intent' and 'slot' predictions. Over the last few years, language models (such as BERT) pre-trained on huge amounts of data achieved state-of-the-art intent and slot predictions by connecting a neural network architecture (e.g., linear, recurrent, long short-term memory, or bidirectional long short-term memory) and fine-tuning all language model and neural network parameters end-to-end. Currently, two language models are specialized in French language: FlauBERT and CamemBERT. This study was designed to find out which combination of language model and neural network architecture was the best for intent and slot prediction by a chatbot from a French corpus of clinical cases. The comparisons showed that FlauBERT performed better than CamemBERT whatever the network architecture used and that complex architectures did not significantly improve performance vs. simple ones whatever the language model. Thus, in the medical field, the results support recommending FlauBERT with a simple linear network architecture.

Keywords: CamemBERT; FlauBERT; Intent and slot prediction; Language models; Natural Language Understanding; Neural network architectures.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Humans
Intention
Language*
Natural Language Processing*
Neural Networks, Computer
Software