Prompt Tuning in Biomedical Relation Extraction

Jianping He; Fang Li; Jianfu Li; Xinyue Hu; Yi Nian; Yang Xiang; Jingqi Wang; Qiang Wei; Yiming Li; Hua Xu; Cui Tao

doi:10.1007/s41666-024-00162-9

Prompt Tuning in Biomedical Relation Extraction

J Healthc Inform Res. 2024 Feb 29;8(2):206-224. doi: 10.1007/s41666-024-00162-9. eCollection 2024 Jun.

Authors

Jianping He¹, Fang Li^{1

2}, Jianfu Li^{1

2}, Xinyue Hu^{1

2}, Yi Nian¹, Yang Xiang¹, Jingqi Wang¹, Qiang Wei¹, Yiming Li¹, Hua Xu³, Cui Tao^{1

2}

Affiliations

¹ McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX USA.
² Department of Artificial Intelligence and Informatics, Mayo Clinic, Jacksonville, FL USA.
³ Department of Bioinformatics and Data Science, Yale School of Medicine, New Haven, CT USA.

PMID: 38681754
PMCID: PMC11052745 (available on 2025-06-01)
DOI: 10.1007/s41666-024-00162-9

Abstract

Biomedical relation extraction (RE) is critical in constructing high-quality knowledge graphs and databases as well as supporting many downstream text mining applications. This paper explores prompt tuning on biomedical RE and its few-shot scenarios, aiming to propose a simple yet effective model for this specific task. Prompt tuning reformulates natural language processing (NLP) downstream tasks into masked language problems by embedding specific text prompts into the original input, facilitating the adaption of pre-trained language models (PLMs) to better address these tasks. This study presents a customized prompt tuning model designed explicitly for biomedical RE, including its applicability in few-shot learning contexts. The model's performance was rigorously assessed using the chemical-protein relation (CHEMPROT) dataset from BioCreative VI and the drug-drug interaction (DDI) dataset from SemEval-2013, showcasing its superior performance over conventional fine-tuned PLMs across both datasets, encompassing few-shot scenarios. This observation underscores the effectiveness of prompt tuning in enhancing the capabilities of conventional PLMs, though the extent of enhancement may vary by specific model. Additionally, the model demonstrated a harmonious balance between simplicity and efficiency, matching state-of-the-art performance without needing external knowledge or extra computational resources. The pivotal contribution of our study is the development of a suitably designed prompt tuning model, highlighting prompt tuning's effectiveness in biomedical RE. It offers a robust, efficient approach to the field's challenges and represents a significant advancement in extracting complex relations from biomedical texts.

Supplementary information: The online version contains supplementary material available at 10.1007/s41666-024-00162-9.

Keywords: Biomedical relation extraction; Few-shot learning; Pre-trained language models; Prompt tuning.

© The Author(s), under exclusive licence to Springer Nature Switzerland AG 2024. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.