A versatile test for equality of two survival functions based on weighted differences of Kaplan-Meier curves

Hajime Uno; Lu Tian; Brian Claggett; L J Wei

doi:10.1002/sim.6591

A versatile test for equality of two survival functions based on weighted differences of Kaplan-Meier curves

Stat Med. 2015 Dec 10;34(28):3680-95. doi: 10.1002/sim.6591. Epub 2015 Jul 20.

Authors

Hajime Uno¹, Lu Tian², Brian Claggett³, L J Wei⁴

Affiliations

¹ Department of Biostatistics and Computational Biology and Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, U.S.A.
² Department of Health Research and Policy, Stanford University School of Medicine, Stanford, 94305, CA, U.S.A.
³ Brigham and Women's Hospital, Division of Cardiovascular Medicine, Harvard Medical School, Boston, 02115, MA, U.S.A.
⁴ Department of Biostatistics, Harvard University, Boston, 02115, MA, U.S.A.

Abstract

With censored event time observations, the logrank test is the most popular tool for testing the equality of two underlying survival distributions. Although this test is asymptotically distribution free, it may not be powerful when the proportional hazards assumption is violated. Various other novel testing procedures have been proposed, which generally are derived by assuming a class of specific alternative hypotheses with respect to the hazard functions. The test considered by Pepe and Fleming (1989) is based on a linear combination of weighted differences of the two Kaplan-Meier curves over time and is a natural tool to assess the difference of two survival functions directly. In this article, we take a similar approach but choose weights that are proportional to the observed standardized difference of the estimated survival curves at each time point. The new proposal automatically makes weighting adjustments empirically. The new test statistic is aimed at a one-sided general alternative hypothesis and is distributed with a short right tail under the null hypothesis but with a heavy tail under the alternative. The results from extensive numerical studies demonstrate that the new procedure performs well under various general alternatives with a caution of a minor inflation of the type I error rate when the sample size is small or the number of observed events is small. The survival data from a recent cancer comparative study are utilized for illustrating the implementation of the process.

Keywords: logrank test; perturbation resampling method; proportional hazards; robust tests.

Publication types

Research Support, N.I.H., Extramural

MeSH terms

Computer Simulation
Humans
Kaplan-Meier Estimate*
Models, Statistical
Observational Studies as Topic / statistics & numerical data
Proportional Hazards Models
Sample Size*
Time Factors

Abstract

Publication types

MeSH terms

Grants and funding