UK surveillance: provision of quality assured information from combined datasets

Prev Vet Med. 2007 Sep 14;81(1-3):117-34. doi: 10.1016/j.prevetmed.2007.04.006. Epub 2007 May 4.

Abstract

Surveillance information is most useful when provided within a risk framework, which is achieved by presenting results against an appropriate denominator. Often the datasets are captured separately and for different purposes, and will have inherent errors and biases that can be further confounded by the act of merging. The United Kingdom Rapid Analysis and Detection of Animal-related Risks (RADAR) system contains data from several sources and provides both data extracts for research purposes and reports for wider stakeholders. Considerable efforts are made to optimise the data in RADAR during the Extraction, Transformation and Loading (ETL) process. Despite efforts to ensure data quality, the final dataset inevitably contains some data errors and biases, most of which cannot be rectified during subsequent analysis. So, in order for users to establish the 'fitness for purpose' of data merged from more than one data source, Quality Statements are produced as defined within the overarching surveillance Quality Framework. These documents detail identified data errors and biases following ETL and report construction as well as relevant aspects of the datasets from which the data originated. This paper illustrates these issues using RADAR datasets, and describes how they can be minimised.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Data Interpretation, Statistical
  • Databases, Factual / standards*
  • Quality Control*
  • Risk Assessment*
  • Risk Management
  • United Kingdom