Application of deep learning for automated diagnosis and classification of hip dysplasia on plain radiographs

BMC Musculoskelet Disord. 2024 Feb 9;25(1):117. doi: 10.1186/s12891-024-07244-0.

Abstract

Background: Hip dysplasia is a condition where the acetabulum is too shallow to support the femoral head and is commonly considered a risk factor for hip osteoarthritis. The objective of this study was to develop a deep learning model to diagnose hip dysplasia from plain radiographs and classify dysplastic hips based on their severity.

Methods: We collected pelvic radiographs of 571 patients from two single-center cohorts and one multicenter cohort. The radiographs were split in half to create hip radiographs (n = 1022). One orthopaedic surgeon and one resident assessed the radiographs for hip dysplasia on either side. We used the center edge (CE) angle as the primary diagnostic criteria. Hips with a CE angle < 20°, 20° to 25°, and > 25° were labeled as dysplastic, borderline, and normal, respectively. The dysplastic hips were also classified with both Crowe and Hartofilakidis classification of dysplasia. The dataset was divided into train, validation, and test subsets using 80:10:10 split-ratio that were used to train two deep learning models to classify images into normal, borderline and (1) Crowe grade 1-4 or (2) Hartofilakidis grade 1-3. A pre-trained on Imagenet VGG16 convolutional neural network (CNN) was utilized by performing layer-wise fine-turning.

Results: Both models struggled with distinguishing between normal and borderline hips. However, achieved high accuracy (Model 1: 92.2% and Model 2: 83.3%) in distinguishing between normal/borderline vs. dysplastic hips. The overall accuracy of Model 1 was 68% and for Model 2 73.5%. Most misclassifications for the Crowe and Hartofilakidis classifications were +/- 1 class from the correct class.

Conclusions: This pilot study shows promising results that a deep learning model distinguish between normal and dysplastic hips with high accuracy. Future research and external validation are warranted regarding the ability of deep learning models to perform complex tasks such as identifying and classifying disorders using plain radiographs.

Level of evidence: Diagnostic level IV.

Keywords: Crowe classification; Deep learning model; Hartofilakidis classification; Hip dysplasia; Radiographs.

Publication types

  • Multicenter Study

MeSH terms

  • Acetabulum / diagnostic imaging
  • Acetabulum / surgery
  • Deep Learning*
  • Hip Dislocation* / diagnostic imaging
  • Hip Dislocation* / surgery
  • Hip Dislocation, Congenital* / diagnostic imaging
  • Hip Dislocation, Congenital* / surgery
  • Humans
  • Pilot Projects
  • Radiography
  • Retrospective Studies