Review of the Performance Metrics for Natural Language Systems for Clinical Trials Matching

Jeongeun Kim; Yuri Quintana

doi:10.3233/SHTI220156

Review of the Performance Metrics for Natural Language Systems for Clinical Trials Matching

Stud Health Technol Inform. 2022 Jun 6:290:641-644. doi: 10.3233/SHTI220156.

Authors

Jeongeun Kim^{1

2}, Yuri Quintana^{1

2}

Affiliations

¹ Harvard Medical School, Boston, MA.
² Beth Israel Deaconess Medical Center, Department of Clinical Informatics, Boston, MA.

PMID: 35673095
DOI: 10.3233/SHTI220156

Abstract

Natural Language Processing (NLP) has been adopted widely in clinical trial matching for its ability to process unstructured text that is often found in electronic health records. Despite the rise in the new tools that use NLP to match patients to eligible clinical trials, the comparison of these tools is difficult due to the lack of consistency in how these tools are evaluated. The ground truth or reference that the tools use to assess results varies, making it difficult to compare the robustness of the tools against each other. This paper alarms the lack of definition and consistency of ground truth data used to evaluate such tools and suggests two ways to define a gold standard for the ground truth in small and large-scale studies.

Keywords: Clinical Trial Matching; Eligibility Criteria; Natural Language Processing.

Publication types

Review

MeSH terms

Benchmarking*
Electronic Health Records
Humans
Language
Natural Language Processing*