Mixture survival trees for cancer risk classification

Lifetime Data Anal. 2022 Jul;28(3):356-379. doi: 10.1007/s10985-022-09552-w. Epub 2022 Apr 29.

Abstract

In oncology studies, it is important to understand and characterize disease heterogeneity among patients so that patients can be classified into different risk groups and one can identify high-risk patients at the right time. This information can then be used to identify a more homogeneous patient population for developing precision medicine. In this paper, we propose a mixture survival tree approach for direct risk classification. We assume that the patients can be classified into a pre-specified number of risk groups, where each group has distinct survival profile. Our proposed tree-based methods are devised to estimate latent group membership using an EM algorithm. The observed data log-likelihood function is used as the splitting criterion in recursive partitioning. The finite sample performance is evaluated by extensive simulation studies and the proposed method is illustrated by a case study in breast cancer.

Keywords: Censoring; Latent model; Mixture distribution; Risk classification; Tree-based method.

MeSH terms

  • Algorithms*
  • Computer Simulation
  • Humans
  • Likelihood Functions
  • Neoplasms*
  • Research Design