Diagnosis of pharyngeal cancer on endoscopic video images by Mask region-based convolutional neural network

Dig Endosc. 2021 May;33(4):569-576. doi: 10.1111/den.13800. Epub 2020 Sep 16.

Abstract

Objectives: We aimed to develop an artificial intelligence (AI) system for the real-time diagnosis of pharyngeal cancers.

Methods: Endoscopic video images and still images of pharyngeal cancer treated in our facility were collected. A total of 4559 images of pathologically proven pharyngeal cancer (1243 using white light imaging and 3316 using narrow-band imaging/blue laser imaging) from 276 patients were used as a training dataset. The AI system used a convolutional neural network (CNN) model typical of the type used to analyze visual imagery. Supervised learning was used to train the CNN. The AI system was evaluated using an independent validation dataset of 25 video images of pharyngeal cancer and 36 video images of normal pharynx taken at our hospital.

Results: The AI system diagnosed 23/25 (92%) pharyngeal cancers as cancers and 17/36 (47%) non-cancers as non-cancers. The transaction speed of the AI system was 0.03 s per image, which meets the required speed for real-time diagnosis. The sensitivity, specificity, and accuracy for the detection of cancer were 92%, 47%, and 66% respectively.

Conclusions: Our single-institution study showed that our AI system for diagnosing cancers of the pharyngeal region had promising performance with high sensitivity and acceptable specificity. Further training and improvement of the system are required with a larger dataset including multiple centers.

Keywords: artificial intelligence; convolutional neural network; pharynx; real-time diagnosis; squamous cell cancer.

MeSH terms

  • Artificial Intelligence*
  • Endoscopy
  • Humans
  • Narrow Band Imaging
  • Neural Networks, Computer
  • Pharyngeal Neoplasms* / diagnostic imaging