Combining deep learning and machine learning for the automatic identification of hip prosthesis failure: Development, validation and explainability analysis

Int J Med Inform. 2023 Aug:176:105095. doi: 10.1016/j.ijmedinf.2023.105095. Epub 2023 May 18.

Abstract

Aim: Revision hip arthroplasty has a less favorable outcome than primary total hip arthroplasty and an understanding of the timing of total hip arthroplasty failure may be helpful. The aim of this study is to develop a combined deep learning (DL) and machine learning (ML) approach to automatically detect hip prosthetic failure from conventional plain radiographs.

Methods: Two cohorts of patients (of 280 and 352 patients) were included in the study, for model development and validation, respectively. The analysis was based on one antero-posterior and one lateral radiographic view obtained from each patient during routine post-surgery follow-up. After pre-processing, three images were obtained: the original image, the acetabulum image and the stem image. These images were analyzed through convolutional neural networks aiming to predict prosthesis failure. Deep features of the three images were extracted for each model and two feature-based pipelines were developed: one utilizing only the features of the original image (original image pipeline) and the other concatenating the features of the three images (3-image pipeline). The obtained features were either used directly or reduced through principal component analysis. Both support vector machine (SVM) and random forest (RF) classifiers were considered for each pipeline.

Results: The SVM applied to the 3-image pipeline provided the best performance, with an accuracy of 0.958 ± 0.006 in the internal validation and an F1-score of 0.874 in the external validation set. The explainability analysis, besides identifying the features of the complete original images as the major contributor, highlighted the role of the acetabulum and stem images on the prediction.

Conclusions: This study demonstrated the potentialities of the developed DL-ML procedure based on plain radiographs in the detection of the failure of the hip prosthesis.

Keywords: Artificial Intelligence; Feature extraction; Hip Replacement; Image classification; PCA algorithm.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arthroplasty, Replacement, Hip*
  • Deep Learning*
  • Hip Prosthesis*
  • Humans
  • Machine Learning
  • Prosthesis Failure