Using threshold regression to analyze survival data from complex surveys: With application to mortality linked NHANES III Phase II genetic data

Stat Med. 2018 Mar 30;37(7):1162-1177. doi: 10.1002/sim.7575. Epub 2017 Dec 18.

Abstract

The Cox proportional hazards (PH) model is a common statistical technique used for analyzing time-to-event data. The assumption of PH, however, is not always appropriate in real applications. In cases where the assumption is not tenable, threshold regression (TR) and other survival methods, which do not require the PH assumption, are available and widely used. These alternative methods generally assume that the study data constitute simple random samples. In particular, TR has not been studied in the setting of complex surveys that involve (1) differential selection probabilities of study subjects and (2) intracluster correlations induced by multistage cluster sampling. In this paper, we extend TR procedures to account for complex sampling designs. The pseudo-maximum likelihood estimation technique is applied to estimate the TR model parameters. Computationally efficient Taylor linearization variance estimators that consider both the intracluster correlation and the differential selection probabilities are developed. The proposed methods are evaluated by using simulation experiments with various complex designs and illustrated empirically by using mortality-linked Third National Health and Nutrition Examination Survey Phase II genetic data.

Keywords: cox proportional hazard; cure rate; intracluster correlation; pseudo-maximum likelihood estimation; stratified multistage sampling.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Cluster Analysis
  • Computer Simulation
  • Databases, Genetic
  • Humans
  • Likelihood Functions*
  • Mortality
  • Multivariate Analysis
  • Nutrition Surveys
  • Regression Analysis*
  • Survival Analysis