Multi-Task Model for Esophageal Lesion Analysis Using Endoscopic Images: Classification with Image Retrieval and Segmentation with Attention

Xiaoyuan Yu; Suigu Tang; Chak Fong Cheang; Hon Ho Yu; I Cheong Choi

doi:10.3390/s22010283

Multi-Task Model for Esophageal Lesion Analysis Using Endoscopic Images: Classification with Image Retrieval and Segmentation with Attention

Sensors (Basel). 2021 Dec 31;22(1):283. doi: 10.3390/s22010283.

Authors

Xiaoyuan Yu¹, Suigu Tang¹, Chak Fong Cheang¹, Hon Ho Yu², I Cheong Choi²

Affiliations

¹ Faculty of Information Technology, Macau University of Science and Technology, Taipa, Macau.
² Kiang Wu Hospital, Santo António, Macau.

Abstract

The automatic analysis of endoscopic images to assist endoscopists in accurately identifying the types and locations of esophageal lesions remains a challenge. In this paper, we propose a novel multi-task deep learning model for automatic diagnosis, which does not simply replace the role of endoscopists in decision making, because endoscopists are expected to correct the false results predicted by the diagnosis system if more supporting information is provided. In order to help endoscopists improve the diagnosis accuracy in identifying the types of lesions, an image retrieval module is added in the classification task to provide an additional confidence level of the predicted types of esophageal lesions. In addition, a mutual attention module is added in the segmentation task to improve its performance in determining the locations of esophageal lesions. The proposed model is evaluated and compared with other deep learning models using a dataset of 1003 endoscopic images, including 290 esophageal cancer, 473 esophagitis, and 240 normal. The experimental results show the promising performance of our model with a high accuracy of 96.76% for the classification and a Dice coefficient of 82.47% for the segmentation. Consequently, the proposed multi-task deep learning model can be an effective tool to help endoscopists in judging esophageal lesions.

Keywords: classification; esophageal endoscopic images; image retrieval; multi-task; segmentation.

MeSH terms

Attention
Endoscopy
Esophageal Neoplasms*
Humans
Image Processing, Computer-Assisted

Grants and funding

0023/2018/AFJ/Science and Technology Development Fund, Macau SAR