Time-to-event analysis with artificial neural networks: an integrated analytical and rule-based study for breast cancer

Neural Netw. 2008 Mar-Apr;21(2-3):414-26. doi: 10.1016/j.neunet.2007.12.034. Epub 2007 Dec 28.

Abstract

This paper presents an analysis of censored survival data for breast cancer specific mortality and disease-free survival. There are three stages to the process, namely time-to-event modelling, risk stratification by predicted outcome and model interpretation using rule extraction. Model selection was carried out using the benchmark linear model, Cox regression but risk staging was derived with Cox regression and with Partial Logistic Regression Artificial Neural Networks regularised with Automatic Relevance Determination (PLANN-ARD). This analysis compares the two approaches showing the benefit of using the neural network framework especially for patients at high risk. The neural network model also has results in a smooth model of the hazard without the need for limiting assumptions of proportionality. The model predictions were verified using out-of-sample testing with the mortality model also compared with two other prognostic models called TNG and the NPI rule model. Further verification was carried out by comparing marginal estimates of the predicted and actual cumulative hazards. It was also observed that doctors seem to treat mortality and disease-free models as equivalent, so a further analysis was performed to observe if this was the case. The analysis was extended with automatic rule generation using Orthogonal Search Rule Extraction (OSRE). This methodology translates analytical risk scores into the language of the clinical domain, enabling direct validation of the operation of the Cox or neural network model. This paper extends the existing OSRE methodology to data sets that include continuous-valued variables.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Breast Neoplasms / mortality*
  • Breast Neoplasms / therapy*
  • Cohort Studies
  • Disease-Free Survival
  • Humans
  • Logistic Models
  • Models, Biological
  • Neoplasm Staging
  • Neural Networks, Computer*
  • Numerical Analysis, Computer-Assisted*
  • Pattern Recognition, Automated / methods*
  • Predictive Value of Tests
  • Proportional Hazards Models
  • Reproducibility of Results
  • Risk Assessment
  • Time Factors