genRCT: a statistical analysis framework for generalizing RCT findings to real-world population

Dasom Lee; Shu Yang; Mark Berry; Tom Stinchcombe; Harvey Jay Cohen; Xiaofei Wang

doi:10.1080/10543406.2024.2333136

genRCT: a statistical analysis framework for generalizing RCT findings to real-world population

J Biopharm Stat. 2024 Apr 8:1-20. doi: 10.1080/10543406.2024.2333136. Online ahead of print.

Authors

Dasom Lee¹, Shu Yang¹, Mark Berry², Tom Stinchcombe³, Harvey Jay Cohen³, Xiaofei Wang⁴

Affiliations

¹ Department of Statistics, North Carolina State University, Elk Grove, USA.
² Department of Cardiothoracic Surgery, Stanford University, Stanford, USA.
³ Department of Medicine, Duke University, Durham, USA.
⁴ Department of Biostatistics & Bioinformatics, Duke University, Durham, USA.

PMID: 38590156
DOI: 10.1080/10543406.2024.2333136

Abstract

When evaluating the real-world treatment effect, the analysis based on randomized clinical trials (RCTs) often introduces generalizability bias due to the difference in risk factors between the trial participants and the real-world patient population. This problem of lack of generalizability associated with the RCT-only analysis can be addressed by leveraging observational studies with large sample sizes that are representative of the real-world population. A set of novel statistical methods, termed "genRCT", for improving the generalizability of the trial has been developed using calibration weighting, which enforces the covariates balance between the RCT and observational study. This paper aims to review statistical methods for generalizing the RCT findings by harnessing information from large observational studies that represent real-world patients. Specifically, we discuss the choices of data sources and variables to meet key theoretical assumptions and principles. We introduce and compare estimation methods for continuous, binary, and survival endpoints. We showcase the use of the $R$ package $genRCT$ through a case study that estimates the average treatment effect of adjuvant chemotherapy for the stage 1B non-small cell lung patients represented by a large cancer registry.

Keywords: Causal inference; Randomized clinical trials; covariates balance; genRCT; generalizability; real-world data; treatment effect heterogeneity.