Predicting rotator cuff tears using data mining and Bayesian likelihood ratios

PLoS One. 2014 Apr 14;9(4):e94917. doi: 10.1371/journal.pone.0094917. eCollection 2014.

Abstract

Objectives: Rotator cuff tear is a common cause of shoulder diseases. Correct diagnosis of rotator cuff tears can save patients from further invasive, costly and painful tests. This study used predictive data mining and Bayesian theory to improve the accuracy of diagnosing rotator cuff tears by clinical examination alone.

Methods: In this retrospective study, 169 patients who had a preliminary diagnosis of rotator cuff tear on the basis of clinical evaluation followed by confirmatory MRI between 2007 and 2011 were identified. MRI was used as a reference standard to classify rotator cuff tears. The predictor variable was the clinical assessment results, which consisted of 16 attributes. This study employed 2 data mining methods (ANN and the decision tree) and a statistical method (logistic regression) to classify the rotator cuff diagnosis into "tear" and "no tear" groups. Likelihood ratio and Bayesian theory were applied to estimate the probability of rotator cuff tears based on the results of the prediction models.

Results: Our proposed data mining procedures outperformed the classic statistical method. The correction rate, sensitivity, specificity and area under the ROC curve of predicting a rotator cuff tear were statistical better in the ANN and decision tree models compared to logistic regression. Based on likelihood ratios derived from our prediction models, Fagan's nomogram could be constructed to assess the probability of a patient who has a rotator cuff tear using a pretest probability and a prediction result (tear or no tear).

Conclusions: Our predictive data mining models, combined with likelihood ratios and Bayesian theory, appear to be good tools to classify rotator cuff tears as well as determine the probability of the presence of the disease to enhance diagnostic decision making for rotator cuff tears.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adolescent
  • Adult
  • Aged
  • Aged, 80 and over
  • Bayes Theorem
  • Data Mining*
  • Demography
  • Female
  • Humans
  • Joint Diseases / diagnosis*
  • Joint Diseases / pathology
  • Likelihood Functions
  • Magnetic Resonance Imaging
  • Male
  • Middle Aged
  • Reference Standards
  • Reproducibility of Results
  • Rotator Cuff / pathology
  • Rotator Cuff Injuries*
  • Rupture
  • Young Adult

Grants and funding

This research was partially supported by National Science Council of Taiwan (ROC) under research grants NSC-100-2218-E-224-014. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. No additional external funding received for this study.