Accuracy of a probabilistic record-linkage methodology used to track blood donors in the Mortality Information System database

Cad Saude Publica. 2014 Aug;30(8):1623-32. doi: 10.1590/0102-311x00024914.

Abstract

The probabilistic record linkage (PRL) is based on a likelihood score that measures the degree of similarity of several matching variables. Screening test results for different diseases are available for the blood donor population. In this paper, we describe the accuracy of a PRL process used to track blood donors from the Fundação Pró-Sangue (FPS) in the Mortality Information System (SIM), in order that future studies might determine the blood donor's cause of death. The databases used for linkage were SIM and the database made up of individuals that were living (200 blood donors in 2007) and dead (196 from the Hospital das Clinicas de São Paulo that died in 2001-2005). The method consists of cleaning and linking the databases using three blocking steps comparing the variables "Name/Mother's Name/ Date of Birth" to determine a cut-off score. For a cut-off score of 7.06, the sensitivity and specificity of the method is 94.4% (95%CI: 90.0-97.0) and 100% (95%CI: 98.0-100.0), respectively. This method can be used in studies that aim to track blood donors from the FPS database in SIM.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Blood Donors / statistics & numerical data*
  • Brazil
  • Cause of Death
  • Humans
  • Information Systems / statistics & numerical data*
  • Medical Record Linkage / methods*
  • Mortality*
  • Predictive Value of Tests
  • Probability
  • Reproducibility of Results
  • Sensitivity and Specificity