A Video Transformer Network for Thyroid Cancer Detection on Hyperspectral Histologic Images

Proc SPIE Int Soc Opt Eng. 2023 Feb:12471:1247107. doi: 10.1117/12.2654851. Epub 2023 Apr 6.

Abstract

Hyperspectral imaging is a label-free and non-invasive imaging modality that seeks to capture images in different wavelengths. In this study, we used a vision transformer that was pre-trained from video data to detect thyroid cancer on hyperspectral images. We built a dataset of 49 whole slide hyperspectral images (WS-HSI) of thyroid cancer. To improve training, we introduced 5 new data augmentation methods that transform spectra. We achieved an F-1 score of 88.1% and an accuracy of 89.64% on our test dataset. The transformer network and the whole slide hyperspectral imaging technique can have many applications in digital pathology.

Keywords: Hyperspectral imaging; deep learning; image classification; thyroid cancer; transformer attention-based neural networks; whole-slide imaging.