Treatment Effect Estimates From Pilot Trials Are Unreliable

Jesse D Troy; Megan L Neely; Gina-Maria Pomann; Steven C Grambow; Gregory P Samsa

doi:10.1016/j.jpainsymman.2023.08.020

Treatment Effect Estimates From Pilot Trials Are Unreliable

J Pain Symptom Manage. 2023 Dec;66(6):e672-e686. doi: 10.1016/j.jpainsymman.2023.08.020. Epub 2023 Sep 2.

Authors

Jesse D Troy¹, Megan L Neely², Gina-Maria Pomann², Steven C Grambow², Gregory P Samsa²

Affiliations

¹ Department of Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, North Carolina, USA. Electronic address: Jesse.Troy@Duke.edu.
² Department of Biostatistics and Bioinformatics, Duke University School of Medicine, Durham, North Carolina, USA.

PMID: 37666368
DOI: 10.1016/j.jpainsymman.2023.08.020

Abstract

Context: The CONSORT guideline defines a pilot trial as a small-scale version of a desired future efficacy trial that is intended to answer the key questions of whether and how a larger study should be done. For example, a pilot trial might evaluate different approaches to data collection or outcome measurement. However, pilot trials are unreliable for assessing treatment efficacy due to the statistical phenomenon called sampling variability.

Objectives: In this tutorial we use computer simulation to demonstrate the influence of sampling variability on efficacy estimates from pilot trials, illustrating why pilot trial designs should not be used to evaluate whether a treatment is promising or not.

Methods: We simulate a 2-arm parallel group trial (N=20 per group) with a survival outcome as an example. Simulations are done under two scenarios: 1) the treatment is efficacious at the level of a hypothetical minimum clinically important difference (hazard ratio [HR] = 0.75); and 2) the treatment is not efficacious (HR=1).

Results: As expected, in both simulated scenarios the range of observed results is distributed around the true treatment effect, HR=0.75 or HR=1. Importantly, ∼20% of trials simulated under scenario 1 incorrectly suggest the treatment may be harmful (HR > 1). Under scenario 2, half of the simulated studies incorrectly suggest the treatment is beneficial.

Conclusion: Treatment effect estimates from pilot trials should not be used to make future development decisions regarding a novel therapy because of the high risk of misleading conclusions.

Keywords: Clinical trials; Pilot studies; Statistics and research methods.

Publication types

Review

MeSH terms

Computer Simulation*
Humans
Pilot Projects
Proportional Hazards Models
Treatment Outcome