Survival forests for data with dependent censoring

Stat Methods Med Res. 2019 Feb;28(2):445-461. doi: 10.1177/0962280217727314. Epub 2017 Aug 24.

Abstract

Tree-based methods are very powerful and popular tools for analysing survival data with right-censoring. The existing methods assume that the true time-to-event and the censoring times are independent given the covariates. We propose different ways to build survival forests when dependent censoring is suspected, by using an appropriate estimator of the survival function when aggregating the individual trees and/or by modifying the splitting rule. The appropriate estimator used in this paper is the copula-graphic estimator. We also propose a new method for building survival forests, called p-forest, that may be used not only when dependent censoring is suspected, but also as a new survival forest method in general. The results from a simulation study indicate that these modifications improve greatly the estimation of the survival function in situations of dependent censoring. A real data example illustrates how the proposed methods can be used to perform a sensitivity analysis.

Keywords: Survival data; copula-graphic; dependent-censoring; ensemble methods; random forest; right-censored data; sensitivity analysis; survival forest.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Analysis of Variance
  • Computer Simulation
  • Data Interpretation, Statistical
  • Humans
  • Liver Cirrhosis / mortality
  • Liver Cirrhosis / surgery
  • Randomized Controlled Trials as Topic / statistics & numerical data
  • Research Design
  • Survival Analysis*