A preliminary deep learning study on automatic segmentation of contrast-enhanced bolus in videofluorography of swallowing

Yoshiko Ariji; Masakazu Gotoh; Motoki Fukuda; Satoshi Watanabe; Toru Nagao; Akitoshi Katsumata; Eiichiro Ariji

doi:10.1038/s41598-022-21530-8

A preliminary deep learning study on automatic segmentation of contrast-enhanced bolus in videofluorography of swallowing

Sci Rep. 2022 Nov 5;12(1):18754. doi: 10.1038/s41598-022-21530-8.

Authors

Yoshiko Ariji^{1

2}, Masakazu Gotoh¹, Motoki Fukuda¹, Satoshi Watanabe³, Toru Nagao³, Akitoshi Katsumata⁴, Eiichiro Ariji⁵

Affiliations

¹ Department of Oral and Maxillofacial Radiology, Aichi-Gakuin University School of Dentistry, 2-11 Suemori-dori, Chikusa-ku, Nagoya, 464-8651, Japan.
² Department of Oral Radiology, School of Dentistry, Osaka Dental University, Osaka, Japan.
³ Department of Maxillofacial Surgery, Aichi-Gakuin University School of Dentistry, Nagoya, Japan.
⁴ Department of Oral Radiology, Asahi University School of Dentistry, Mizuho, Japan.
⁵ Department of Oral and Maxillofacial Radiology, Aichi-Gakuin University School of Dentistry, 2-11 Suemori-dori, Chikusa-ku, Nagoya, 464-8651, Japan. ariji@dpc.agu.ac.jp.

Abstract

Although videofluorography (VFG) is an effective tool for evaluating swallowing functions, its accurate evaluation requires considerable time and effort. This study aimed to create a deep learning model for automated bolus segmentation on VFG images of patients with healthy swallowing and dysphagia using the artificial intelligence deep learning segmentation method, and to assess the performance of the method. VFG images of 72 swallowing of 12 patients were continuously converted into 15 static images per second. In total, 3910 images were arbitrarily assigned to the training, validation, test 1, and test 2 datasets. In the training and validation datasets, images of colored bolus areas were prepared, along with original images. Using a U-Net neural network, a trained model was created after 500 epochs of training. The test datasets were applied to the trained model, and the performances of automatic segmentation (Jaccard index, Sørensen-Dice coefficient, and sensitivity) were calculated. All performance values for the segmentation of the test 1 and 2 datasets were high, exceeding 0.9. Using an artificial intelligence deep learning segmentation method, we automatically segmented the bolus areas on VFG images; our method exhibited high performance. This model also allowed assessment of aspiration and laryngeal invasion.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Artificial Intelligence
Deep Learning*
Deglutition
Humans
Image Processing, Computer-Assisted / methods
Neural Networks, Computer