Towards Phenotyping of Clinical Trial Eligibility Criteria

Matthias Löbe; Sebastian Stäubert; Colleen Goldberg; Ivonne Haffner; Alfred Winter

Towards Phenotyping of Clinical Trial Eligibility Criteria

Stud Health Technol Inform. 2018:248:293-299.

Authors

Matthias Löbe¹, Sebastian Stäubert¹, Colleen Goldberg¹, Ivonne Haffner², Alfred Winter¹

Affiliations

¹ Institute for Medical Informatics, Statistics and Epidemiology (IMISE), Universität Leipzig, Germany.
² University Cancer Center Leipzig, Germany.

PMID: 29726450

Abstract

Background: Medical plaintext documents contain important facts about patients, but they are rarely available for structured queries. The provision of structured information from natural language texts in addition to the existing structured data can significantly speed up the search for fulfilled inclusion criteria and thus improve the recruitment rate.

Objectives: This work is aimed at supporting clinical trial recruitment with text mining techniques to identify suitable subjects in hospitals.

Method: Based on the inclusion/exclusion criteria of 5 sample studies and a text corpus consisting of 212 doctor's letters and medical follow-up documentation from a university cancer center, a prototype was developed and technically evaluated using NLP procedures (UIMA) for the extraction of facts from medical free texts.

Results: It was found that although the extracted entities are not always correct (precision between 23% and 96%), they provide a decisive indication as to which patient file should be read preferentially.

Conclusion: The prototype presented here demonstrates the technical feasibility. In order to find available, lucrative phenotypes, an in-depth evaluation is required.

Keywords: Apache UIMA; Clinical Trials; NLP; Phenotyping; Recruitment; Text Mining; cTAKES.

MeSH terms

Clinical Trials as Topic*
Data Mining*
Eligibility Determination*
Humans
Language
Natural Language Processing*
Research Design