Is it possible to estimate the incidence of breast cancer from medico-administrative databases?

Eur J Epidemiol. 2008;23(10):681-8. doi: 10.1007/s10654-008-9282-y. Epub 2008 Aug 21.

Abstract

One approach to estimate cancer incidence in the French Départements is to quantify the relationship between data in cancer registries and data obtained from the PMSI (Programme de Médicalisation des Systèmes d'Information Médicale). This relationship may then be used in Départements without registries to infer the incidence from local PMSI data. We present here some methodological solutions to apply this approach. Data on invasive breast cancer for 2002 were obtained from 12 Départemental registries. The number of hospital stays was obtained from the National PMSI using two different algorithms based on the main diagnosis only (Algorithm 1) or on that diagnosis associated to a mention of "resection" (Algorithm 2). Considering registry data as gold standard, a calibration approach was used to model the ratio of the number of hospital stays to the number of incident cases. In Départements with registries, validation of the predictions was done through cross-validation. In Départements without registries, validation was done through a study of homogeneity of the mean number of hospital stays per patient. Cross-validation showed that the estimates predicted by the model were true with data extracted by Algorithm 1 but not by Algorithm 2. However, with Algorithm 1, there was an important heterogeneity between French Départements as to the mean number of hospital stays per patient, which had an important impact on the estimations. In the near future, the method will allow using medico-administrative data (after calibration with registry data) to estimate Départemental incidence of selected cancers.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Aged
  • Aged, 80 and over
  • Breast Neoplasms / epidemiology*
  • Databases, Factual*
  • Epidemiologic Studies
  • Female
  • France
  • Humans
  • Medical Records / statistics & numerical data*
  • Middle Aged
  • Models, Statistical
  • Registries
  • Young Adult