Using Machine Learning to Unravel the Value of Radiographic Features for the Classification of Bone Tumors

Biomed Res Int. 2021 Mar 11:2021:8811056. doi: 10.1155/2021/8811056. eCollection 2021.

Abstract

Objectives: To build and validate random forest (RF) models for the classification of bone tumors based on the conventional radiographic features of the lesion and patients' clinical characteristics, and identify the most essential features for the classification of bone tumors.

Materials and methods: In this retrospective study, 796 patients (benign bone tumors: 412 cases, malignant bone tumors: 215 cases, intermediate bone tumors: 169 cases) with pathologically confirmed bone tumors from Nanfang Hospital of Southern Medical University, Foshan Hospital of TCM, and University of Hong Kong-Shenzhen Hospital were enrolled. RF models were built to classify tumors as benign, malignant, or intermediate based on conventional radiographic features and potentially relevant clinical characteristics extracted by three musculoskeletal radiologists with ten years of experience. SHapley Additive exPlanations (SHAP) was used to identify the most essential features for the classification of bone tumors. The diagnostic performance of the RF models was quantified using receiver operating characteristic (ROC) curves.

Results: The features extracted by the three radiologists had a satisfactory agreement and the minimum intraclass correlation coefficient (ICC) was 0.761 (CI: 0.686-0.824, P < .001). The binary and tertiary models were built to classify tumors as benign, malignant, or intermediate based on the imaging and clinical features from 627 and 796 patients. The AUC of the binary (19 variables) and tertiary (22 variables) models were 0.97 and 0.94, respectively. The accuracy of binary and tertiary models were 94.71% and 82.77%, respectively. In descending order, the most important features influencing classification in the binary model were margin, cortex involvement, and the pattern of bone destruction, and the most important features in the tertiary model were margin, high-density components, and cortex involvement.

Conclusions: This study developed interpretable models to classify bone tumors with great performance. These should allow radiographers to identify imaging features that are important for the classification of bone tumors in the clinical setting.

MeSH terms

  • Adolescent
  • Adult
  • Bone Neoplasms / classification*
  • Bone Neoplasms / diagnostic imaging*
  • Child
  • Female
  • Humans
  • Machine Learning*
  • Male
  • Middle Aged
  • Tomography, X-Ray Computed*