An integrated AI model to improve diagnostic accuracy of ultrasound and output known risk features in suspicious thyroid nodules

Eur Radiol. 2022 Mar;32(3):2120-2129. doi: 10.1007/s00330-021-08298-7. Epub 2021 Oct 18.

Abstract

Objectives: From the viewpoint of ultrasound (US) physicians, an ideal thyroid US computer-assisted diagnostic (CAD) system for thyroid cancer should perform well in suspicious thyroid nodules with atypical risk features and be able to output explainable results. This study aims to develop an explainable US CAD model for suspicious thyroid nodules.

Methods: A total of 2992 solid or almost-solid thyroid nodules were analyzed retrospectively. All nodules had pathological results (1070 malignancies and 1992 benignities) confirmed by ultrasound-guided fine-needle aspiration cytology and histopathology after thyroidectomy. A deep learning model (ResNet50) and a multiple risk features learning ensemble model (XGBoost) were used to train the US images of 2794 thyroid nodules. Then, an integrated AI model was generated by combining both models. The diagnostic accuracies of the three AI models (ResNet50, XGBoost, and the integrated model) were predicted in a testing set including 198 thyroid nodules and compared to the diagnostic efficacy of five ultrasonographers.

Results: The accuracy of the integrated model was 76.77%, while the mean accuracy of the ultrasonographers was 68.38%. Of the risk features, microcalcifications showed the highest contribution to the diagnosis of malignant nodules.

Conclusions: The integrated AI model in our study can improve the diagnostic accuracy of suspicious thyroid nodules and output the known risk features simultaneously, thus aiding in training young ultrasonographers by linking the explainable results to their clinical experience and advancing the acceptance of AI diagnosis for thyroid cancer in clinical practice.

Key points: • We developed an artificial intelligence (AI) diagnosis model based on both deep learning and multiple risk feature ensemble learning methods. • The AI diagnosis model showed higher diagnostic accuracy for suspicious thyroid nodules than ultrasonographers. • The AI diagnosis model showed partial explainability by outputting the known risk features, thus aiding young ultrasonic doctors in increasing the diagnostic level for thyroid cancer.

Keywords: Artificial intelligence; Deep learning; Thyroid cancer; Thyroid nodules; Ultrasound.

MeSH terms

  • Artificial Intelligence
  • Humans
  • Retrospective Studies
  • Sensitivity and Specificity
  • Thyroid Neoplasms* / diagnostic imaging
  • Thyroid Nodule* / diagnostic imaging
  • Ultrasonography