Data management and data analysis techniques in pharmacoepidemiological studies using a pre-planned multi-database approach: a systematic literature review

Pharmacoepidemiol Drug Saf. 2015 Sep;24(9):897-905. doi: 10.1002/pds.3828. Epub 2015 Jul 14.

Abstract

Purpose: To identify pharmacoepidemiological multi-database studies and to describe data management and data analysis techniques used for combining data.

Methods: Systematic literature searches were conducted in PubMed and Embase complemented by a manual literature search. We included pharmacoepidemiological multi-database studies published from 2007 onwards that combined data for a pre-planned common analysis or quantitative synthesis. Information was retrieved about study characteristics, methods used for individual-level analyses and meta-analyses, data management and motivations for performing the study.

Results: We found 3083 articles by the systematic searches and an additional 176 by the manual search. After full-text screening of 75 articles, 22 were selected for final inclusion. The number of databases used per study ranged from 2 to 17 (median = 4.0). Most studies used a cohort design (82%) instead of a case-control design (18%). Logistic regression was most often used for individual-level analyses (41%), followed by Cox regression (23%) and Poisson regression (14%). As meta-analysis method, a majority of the studies combined individual patient data (73%). Six studies performed an aggregate meta-analysis (27%), while a semi-aggregate approach was applied in three studies (14%). Information on central programming or heterogeneity assessment was missing in approximately half of the publications. Most studies were motivated by improving power (86%).

Conclusions: Pharmacoepidemiological multi-database studies are a well-powered strategy to address safety issues and have increased in popularity. To be able to correctly interpret the results of these studies, it is important to systematically report on database management and analysis techniques, including central programming and heterogeneity testing.

Keywords: analysis techniques; data management; multi-database; pharmacoepidemiology; systematic review.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review
  • Systematic Review

MeSH terms

  • Case-Control Studies
  • Cohort Studies
  • Databases, Factual* / trends
  • Humans
  • Pharmacoepidemiology / methods*
  • Pharmacoepidemiology / trends
  • Statistics as Topic / methods*
  • Statistics as Topic / trends