PEDL+: protein-centered relation extraction from PubMed at your fingertip

Leon Weber; Fabio Barth; Leonie Lorenz; Fabian Konrath; Kirsten Huska; Jana Wolf; Ulf Leser

doi:10.1093/bioinformatics/btad603

PEDL+: protein-centered relation extraction from PubMed at your fingertip

Bioinformatics. 2023 Nov 1;39(11):btad603. doi: 10.1093/bioinformatics/btad603.

Authors

Leon Weber¹, Fabio Barth², Leonie Lorenz³, Fabian Konrath⁴, Kirsten Huska⁴, Jana Wolf^{4

5}, Ulf Leser²

Affiliations

¹ Center for Information and Language Processing, Ludwig-Maximilians-Universität München, Geschwister-Scholl-Platz 1, München 80539, Germany.
² Computer Science Department, Humboldt-Universität zu Berlin, Unter den Linden 6, Berlin 10099, Germany.
³ Pathogen Informatics and Modelling, EMBL-EBI, Hinxton, Cambridgeshire CB10 1SD, United Kingdom.
⁴ Mathematical Modelling of Cellular Processes, Max Delbrück Center for Molecular Medicine, Robert-Rössle-Str. 10, Berlin 13125, Germany.
⁵ Department of Mathematics and Computer Science, Free University Berlin, Berlin, 14195, Germany.

Abstract

Summary: Relation extraction (RE) from large text collections is an important tool for database curation, pathway reconstruction, or functional omics data analysis. In practice, RE often is part of a complex data analysis pipeline requiring specific adaptations like restricting the types of relations or the set of proteins to be considered. However, current systems are either non-programmable web sites or research code with fixed functionality. We present PEDL+, a user-friendly tool for extracting protein-protein and protein-chemical associations from PubMed articles. PEDL+ combines state-of-the-art NLP technology with adaptable ranking and filtering options and can easily be integrated into analysis pipelines. We evaluated PEDL+ in two pathway curation projects and found that 59% to 80% of its extractions were helpful.

Availability and implementation: PEDL+ is freely available at https://github.com/leonweber/pedl.

PEDL+: protein-centered relation extraction from PubMed at your fingertip

Authors

Affiliations

Abstract

Publication types

MeSH terms

Grants and funding