Informatics and data quality at collaborative multicenter Breast and Colon Cancer Family Registries

J Am Med Inform Assoc. 2012 Jun;19(e1):e125-8. doi: 10.1136/amiajnl-2011-000546. Epub 2012 Feb 9.

Abstract

Quality control and harmonization of data is a vital and challenging undertaking for any successful data coordination center and a responsibility shared between the multiple sites that produce, integrate, and utilize the data. Here we describe a coordinated effort between scientists and data managers in the Cancer Family Registries to implement a data governance infrastructure consisting of both organizational and technical solutions. The technical solution uses a rule-based validation system that facilitates error detection and correction for data centers submitting data to a central informatics database. Validation rules comprise both standard checks on allowable values and a crosscheck of related database elements for logical and scientific consistency. Evaluation over a 2-year timeframe showed a significant decrease in the number of errors in the database and a concurrent increase in data consistency and accuracy.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Breast Neoplasms* / epidemiology
  • Colonic Neoplasms* / epidemiology
  • Databases, Factual / standards*
  • Databases, Factual / statistics & numerical data
  • Humans
  • Quality Control
  • Registries / standards*
  • Research Design
  • United States