Towards a scientific workflow methodology for primary care database studies

Stat Methods Med Res. 2010 Aug;19(4):378-93. doi: 10.1177/0962280209359880. Epub 2010 May 4.

Abstract

We describe the challenges of conducting studies based on mining large-scale primary care databases, namely data integration, data set definition, result reproducibility and reusability. These correspond to higher-level informatics challenges of automation, provenance capture and component integration. We provide a high-level view of the informatics infrastructure that addresses these challenges through a generic workflow-based e-Science middleware, and describe our experiences using the system to investigate differences in the health status of patients with diabetes before and after the national introduction of the UK GP contract in 2004.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Data Mining / standards
  • Data Mining / statistics & numerical data
  • Databases, Factual / standards
  • Databases, Factual / statistics & numerical data*
  • Diabetes Mellitus, Type 1 / drug therapy*
  • Diabetes Mellitus, Type 2 / drug therapy*
  • Female
  • Humans
  • Male
  • Medical Informatics / methods
  • Medical Informatics / standards
  • Medical Informatics / statistics & numerical data
  • Primary Health Care / standards
  • Primary Health Care / statistics & numerical data*
  • United Kingdom
  • Workflow*