Data mining in bone marrow transplant records to identify patients with high odds of survival

IEEE J Biomed Health Inform. 2014 Jan;18(1):21-7. doi: 10.1109/JBHI.2013.2274733.

Abstract

Patients undergoing a bone marrow stem cell transplant (BMT) face various risk factors. Analyzing data from past transplants could enhance the understanding of the factors influencing success. Records up to 120 measurements per transplant procedure from 1751 patients undergoing BMT were collected (Shariati Hospital). Collaborative filtering techniques allowed the processing of highly sparse records with 22.3% missing values. Ten-fold cross-validation was used to evaluate the performance of various classification algorithms trained on predicting the survival status. Modest accuracy levels were obtained in predicting the survival status (AUC = 0.69). More importantly, however, operations that had the highest chances of success were shown to be identifiable with high accuracy, e.g., 92% or 97% when identifying 74 or 31 recipients, respectively. Identifying the patients with the highest chances of survival has direct application in the prioritization of resources and in donor matching. For patients where high-confidence prediction is not achieved, assigning a probability to their survival odds has potential applications in probabilistic decision support systems and in combination with other sources of information.

MeSH terms

  • Adolescent
  • Adult
  • Aged
  • Bayes Theorem
  • Bone Marrow Transplantation / mortality*
  • Bone Marrow Transplantation / statistics & numerical data
  • Child
  • Child, Preschool
  • Computational Biology / methods*
  • Data Mining / methods*
  • Female
  • Graft Survival
  • Humans
  • Male
  • Medical Informatics / methods*
  • Middle Aged
  • Principal Component Analysis
  • ROC Curve
  • Risk Factors
  • Young Adult