A versatile test for equality of two survival functions based on weighted differences of Kaplan-Meier curves

Stat Med. 2015 Dec 10;34(28):3680-95. doi: 10.1002/sim.6591. Epub 2015 Jul 20.

Abstract

With censored event time observations, the logrank test is the most popular tool for testing the equality of two underlying survival distributions. Although this test is asymptotically distribution free, it may not be powerful when the proportional hazards assumption is violated. Various other novel testing procedures have been proposed, which generally are derived by assuming a class of specific alternative hypotheses with respect to the hazard functions. The test considered by Pepe and Fleming (1989) is based on a linear combination of weighted differences of the two Kaplan-Meier curves over time and is a natural tool to assess the difference of two survival functions directly. In this article, we take a similar approach but choose weights that are proportional to the observed standardized difference of the estimated survival curves at each time point. The new proposal automatically makes weighting adjustments empirically. The new test statistic is aimed at a one-sided general alternative hypothesis and is distributed with a short right tail under the null hypothesis but with a heavy tail under the alternative. The results from extensive numerical studies demonstrate that the new procedure performs well under various general alternatives with a caution of a minor inflation of the type I error rate when the sample size is small or the number of observed events is small. The survival data from a recent cancer comparative study are utilized for illustrating the implementation of the process.

Keywords: logrank test; perturbation resampling method; proportional hazards; robust tests.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Computer Simulation
  • Humans
  • Kaplan-Meier Estimate*
  • Models, Statistical
  • Observational Studies as Topic / statistics & numerical data
  • Proportional Hazards Models
  • Sample Size*
  • Time Factors