Probabilistic modeling of short survivability in patients with brain metastasis from lung cancer

Comput Methods Programs Biomed. 2015 May;119(3):142-62. doi: 10.1016/j.cmpb.2015.02.005. Epub 2015 Feb 21.

Abstract

The prediction of substantially short survivability in patients is extremely risky. In this study, we proposed a probabilistic model using Bayesian network (BN) to predict the short survivability of patients with brain metastasis from lung cancer. A nationwide cancer patient database from 1996 to 2010 in Taiwan was used. The cohort consisted of 438 patients with brain metastasis from lung cancer. We utilized synthetic minority over-sampling technique (SMOTE) to solve the imbalanced property embedded in the problem. The proposed BN was compared with three competitive models, namely, naive Bayes (NB), logistic regression (LR), and support vector machine (SVM). Statistical analysis showed that performances of BN, LR, NB, and SVM were statistically the same in terms of all indices with low sensitivity when these models were applied on an imbalanced data set. Results also showed that SMOTE can improve the performance of the four models in terms of sensitivity, while keeping high accuracy and specificity. Further, the proposed BN is more effective as compared with NB, LR, and SVM from two perspectives: the transparency and ability to show the relation of factors affecting brain metastasis from lung cancer; it allows decision makers to find the probability despite incomplete evidence and information; and the sensitivity of the proposed BN is the highest among all standard machine learning methods.

Keywords: Bayesian network; Brain metastasis; Lung cancer; Survivability.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Aged
  • Bayes Theorem
  • Brain Neoplasms / mortality
  • Brain Neoplasms / secondary*
  • Databases, Factual / statistics & numerical data
  • Female
  • Humans
  • Logistic Models
  • Lung Neoplasms*
  • Male
  • Middle Aged
  • Models, Biological
  • Models, Statistical*
  • Prognosis
  • Support Vector Machine
  • Survival Analysis
  • Taiwan / epidemiology