Extracting temporal constraints from clinical research eligibility criteria using conditional random fields

AMIA Annu Symp Proc. 2011:2011:843-52. Epub 2011 Oct 22.

Abstract

Temporal constraints are present in 38% of clinical research eligibility criteria and are crucial for screening patients. However, eligibility criteria are often written as free text, which is not amenable for computer processing. In this paper, we present an ontology-based approach to extracting temporal information from clinical research eligibility criteria. We generated temporal labels using a frame-based temporal ontology. We manually annotated 150 free-text eligibility criteria using the temporal labels and trained a parser using Conditional Random Fields (CRFs) to automatically extract temporal expressions from eligibility criteria. An evaluation of an additional 60 randomly selected eligibility criteria using manual review achieved an overall precision of 83%, a recall of 79%, and an F-score of 80%. We illustrate the application of temporal extraction with the use cases of question answering and free-text criteria querying.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Biomedical Research*
  • Clinical Trials as Topic*
  • Databases, Factual
  • Eligibility Determination*
  • Humans
  • Information Storage and Retrieval / methods*
  • Natural Language Processing*
  • Time
  • Vocabulary, Controlled