Answering List-Type Questions in Health Domain with Pretrained Large Language Model: A Case for COVID-19 Symptoms

Keyuan Jiang; Mohammed M Mujtaba; Gordon R Bernard

doi:10.3233/SHTI231041

Answering List-Type Questions in Health Domain with Pretrained Large Language Model: A Case for COVID-19 Symptoms

Stud Health Technol Inform. 2024 Jan 25:310:629-633. doi: 10.3233/SHTI231041.

Authors

Keyuan Jiang¹, Mohammed M Mujtaba¹, Gordon R Bernard²

Affiliations

¹ Purdue University Northwest, Hammond, Indiana, USA.
² Vanderbilt University, Nashville, Tennessee, USA.

PMID: 38269885
DOI: 10.3233/SHTI231041

Abstract

List-type questions, which can have a varying number of answers, are more common in the health domain where people seek for health-related information from a passage or passages. An example of this type of question answering task is to find COVID-19 symptoms from a Twitter post. However, due to the lack of annotated instances for supervised learning, automatic identification of COVID-19 symptoms from Twitter posts is challenging. We investigated detection of symptom mentions in Twitter posts using GPT-3, a pre-trained large language model, along with few-shot learning. Our results of 5-shot and 10-shot learning on a corpus of 655 annotated tweets demonstrate that few-shot learning with pre-trained large language model is a promising approach to answering list-type questions with a minimal amount of effort of annotation.

Keywords: COVID-19 symptoms; Few-shot learning; GPT-3; List-type question answering; Pre-trained large language model.

MeSH terms

COVID-19*
Humans
Language