Applying Data Warehousing to a Phase III Clinical Trial From the Fondazione Italiana Linfomi Ensures Superior Data Quality and Improved Assessment of Clinical Outcomes

JCO Clin Cancer Inform. 2019 Oct:3:1-15. doi: 10.1200/CCI.19.00049.

Abstract

Purpose: Data collection in clinical trials is becoming complex, with a huge number of variables that need to be recorded, verified, and analyzed to effectively measure clinical outcomes. In this study, we used data warehouse (DW) concepts to achieve this goal. A DW was developed to accommodate data from a large clinical trial, including all the characteristics collected. We present the results related to baseline variables with the following objectives: developing a data quality (DQ) control strategy and improving outcome analysis according to the clinical trial primary end points.

Methods: Data were retrieved from the electronic case reporting forms (eCRFs) of the phase III, multicenter MCL0208 trial (ClinicalTrials.gov identifier: NCT02354313) of the Fondazione Italiana Linfomi for younger patients with untreated mantle cell lymphoma (MCL). The DW was created with a relational database management system. Recommended DQ dimensions were observed to monitor the activity of each site to handle DQ management during patient follow-up. The DQ management was applied to clinically relevant parameters that predicted progression-free survival to assess its impact.

Results: The DW encompassed 16 tables, which included 226 variables for 300 patients and 199,500 items of data. The tool allowed cross-comparison analysis and detected some incongruities in eCRFs, prompting queries to clinical centers. This had an impact on clinical end points, as the DQ control strategy was able to improve the prognostic stratification according to single parameters, such as tumor infiltration by flow cytometry, and even using established prognosticators, such as the MCL International Prognostic Index.

Conclusion: The DW is a powerful tool to organize results from large phase III clinical trials and to effectively improve DQ through the application of effective engineered tools.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aged
  • Clinical Trials, Phase III as Topic
  • Data Warehousing / methods*
  • Data Warehousing / standards*
  • Disease Progression
  • Female
  • Humans
  • Lymphoma, Mantle-Cell / diagnosis
  • Lymphoma, Mantle-Cell / mortality*
  • Lymphoma, Mantle-Cell / therapy*
  • Male
  • Multicenter Studies as Topic
  • Neoplasm Staging
  • Quality Assurance, Health Care / methods*
  • Randomized Controlled Trials as Topic
  • Survival Rate
  • Treatment Outcome

Associated data

  • ClinicalTrials.gov/NCT02354313