False Negative Rates in Benign Thyroid Nodule Diagnosis: Machine Learning for Detecting Malignancy

J Surg Res. 2021 Dec:268:562-569. doi: 10.1016/j.jss.2021.06.076. Epub 2021 Aug 28.

Abstract

Background: Thyroid nodules are common; up to 67% of adults will show nodules on high-quality ultrasound, and 95% of these nodules are benign. FNA cytology is a crucial step in determining the risk of malignancy, and a false negative diagnosis at this stage delays cancer treatment. The purpose of this study is to develop a predictive model using machine learning which can identify false negative FNA results based on less-invasive clinical data.

Materials and methods: We conducted a retrospective medical record review at one academic and one community center. Inclusion criteria were thyroid nodules evaluated by ultrasound and FNA with a Bethesda II (benign) result or malignancy detected on pathology or FNA. Linear, non-linear, and ensemble models were generated with scikit-learn using 10-fold cross validation with repetition and compared with AUROC. The classification task was the prediction of malignancy using information acquired from less-invasive ultrasound and FNA.

Results: A total of 604 subjects met inclusion criteria; 38 were diagnosed with malignancy. Of all algorithms tested, a Random Forest method achieved the best AUROC (0.64) in separating benign and malignant nodules, though the improvement over other tested algorithms was not statistically significant.

Conclusions: A Random Forest model performed better than random chance using readily available data obtained via standard evaluation of thyroid nodules. The diagnostic probability threshold of this model can be varied to minimize false positives at the cost of increasing the number of false negatives. Future studies will prospectively evaluate the model's performance.

Keywords: Bethesda II; False negative; Machine learning; Random forest; Thyroid nodules.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Biopsy, Fine-Needle
  • Humans
  • Machine Learning
  • Retrospective Studies
  • Thyroid Neoplasms* / diagnostic imaging
  • Thyroid Neoplasms* / pathology
  • Thyroid Nodule* / diagnostic imaging
  • Thyroid Nodule* / pathology