Efficient Prediction of Missed Clinical Appointment Using Machine Learning

Comput Math Methods Med. 2021 Oct 22:2021:2376391. doi: 10.1155/2021/2376391. eCollection 2021.

Abstract

Public health and its related facilities are crucial for thriving cities and societies. The optimum utilization of health resources saves money and time, but above all, it saves precious lives. It has become even more evident in the present as the pandemic has overstretched the existing medical resources. Specific to patient appointment scheduling, the casual attitude of missing medical appointments (no-show-ups) may cause severe damage to a patient's health. In this paper, with the help of machine learning, we analyze six million plus patient appointment records to predict a patient's behaviors/characteristics by using ten different machine learning algorithms. For this purpose, we first extracted meaningful features from raw data using data cleaning. We applied Synthetic Minority Oversampling Technique (SMOTE), Adaptive Synthetic Sampling Method (Adasyn), and random undersampling (RUS) to balance our data. After balancing, we applied ten different machine learning algorithms, namely, random forest classifier, decision tree, logistic regression, XG Boost, gradient boosting, Adaboost Classifier, Naive Bayes, stochastic gradient descent, multilayer perceptron, and Support Vector Machine. We analyzed these results with the help of six different metrics, i.e., recall, accuracy, precision, F1-score, area under the curve, and mean square error. Our study has achieved 94% recall, 86% accuracy, 83% precision, 87% F1-score, 92% area under the curve, and 0.106 minimum mean square error. Effectiveness of presented data cleaning and feature selection is confirmed by better results in all training algorithms. Notably, recall is greater than 75%, accuracy is greater than 73%, F1-score is more significant than 75%, MSE is lesser than 0.26, and AUC is greater than 74%. The research shows that instead of individual features, combining different features helps make better predictions of a patient's appointment status.

Publication types

  • Retracted Publication

MeSH terms

  • Algorithms*
  • Appointments and Schedules*
  • Area Under Curve
  • Bayes Theorem
  • Computational Biology
  • Data Interpretation, Statistical
  • Databases, Factual
  • Decision Trees
  • Humans
  • Logistic Models
  • Machine Learning*
  • Neural Networks, Computer
  • No-Show Patients / statistics & numerical data*
  • Stochastic Processes
  • Support Vector Machine