Logistic-AFT location-scale mixture regression models with nonsusceptibility for left-truncated and general interval-censored data

Stat Med. 2013 Oct 30;32(24):4285-305. doi: 10.1002/sim.5845. Epub 2013 May 10.

Abstract

In conventional survival analysis there is an underlying assumption that all study subjects are susceptible to the event. In general, this assumption does not adequately hold when investigating the time to an event other than death. Owing to genetic and/or environmental etiology, study subjects may not be susceptible to the disease. Analyzing nonsusceptibility has become an important topic in biomedical, epidemiological, and sociological research, with recent statistical studies proposing several mixture models for right-censored data in regression analysis. In longitudinal studies, we often encounter left, interval, and right-censored data because of incomplete observations of the time endpoint, as well as possibly left-truncated data arising from the dissimilar entry ages of recruited healthy subjects. To analyze these kinds of incomplete data while accounting for nonsusceptibility and possible crossing hazards in the framework of mixture regression models, we utilize a logistic regression model to specify the probability of susceptibility, and a generalized gamma distribution, or a log-logistic distribution, in the accelerated failure time location-scale regression model to formulate the time to the event. Relative times of the conditional event time distribution for susceptible subjects are extended in the accelerated failure time location-scale submodel. We also construct graphical goodness-of-fit procedures on the basis of the Turnbull-Frydman estimator and newly proposed residuals. Simulation studies were conducted to demonstrate the validity of the proposed estimation procedure. The mixture regression models are illustrated with alcohol abuse data from the Taiwan Aboriginal Study Project and hypertriglyceridemia data from the Cardiovascular Disease Risk Factor Two-township Study in Taiwan.

Keywords: accelerated failure time model; crossing hazards; cure model; generalized gamma distribution; log-logistic distribution; logistic regression.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adolescent
  • Adult
  • Age of Onset
  • Alcoholism / epidemiology
  • Computer Simulation
  • Data Interpretation, Statistical*
  • Disease Susceptibility*
  • Female
  • Humans
  • Logistic Models*
  • Male
  • Middle Aged
  • Native Hawaiian or Other Pacific Islander
  • Stroke / blood
  • Survival Analysis*
  • Taiwan
  • Triglycerides / blood
  • Young Adult

Substances

  • Triglycerides