Interpretable and Reliable Oral Cancer Classifier with Attention Mechanism and Expert Knowledge Embedding via Attention Map

Bofan Song; Chicheng Zhang; Sumsum Sunny; Dharma Raj Kc; Shaobai Li; Keerthi Gurushanth; Pramila Mendonca; Nirza Mukhia; Sanjana Patrick; Shubha Gurudath; Subhashini Raghavan; Imchen Tsusennaro; Shirley T Leivon; Trupti Kolur; Vivek Shetty; Vidya Bushan; Rohan Ramesh; Vijay Pillai; Petra Wilder-Smith; Amritha Suresh; Moni Abraham Kuriakose; Praveen Birur; Rongguang Liang

doi:10.3390/cancers15051421

Interpretable and Reliable Oral Cancer Classifier with Attention Mechanism and Expert Knowledge Embedding via Attention Map

Cancers (Basel). 2023 Feb 23;15(5):1421. doi: 10.3390/cancers15051421.

Authors

Bofan Song¹, Chicheng Zhang², Sumsum Sunny³, Dharma Raj Kc², Shaobai Li¹, Keerthi Gurushanth⁴, Pramila Mendonca⁵, Nirza Mukhia⁴, Sanjana Patrick⁶, Shubha Gurudath⁴, Subhashini Raghavan⁴, Imchen Tsusennaro⁷, Shirley T Leivon⁷, Trupti Kolur⁵, Vivek Shetty⁵, Vidya Bushan⁵, Rohan Ramesh⁷, Vijay Pillai⁵, Petra Wilder-Smith⁸, Amritha Suresh^{3

5}, Moni Abraham Kuriakose⁹, Praveen Birur^{4

6}, Rongguang Liang¹

Affiliations

¹ Wyant College of Optical Sciences, The University of Arizona, Tucson, AZ 85721, USA.
² Computer Science Department, The University of Arizona, Tucson, AZ 85721, USA.
³ Mazumdar Shaw Medical Centre, Bangalore 560099, India.
⁴ KLE Society Institute of Dental Sciences, Bangalore 560022, India.
⁵ Mazumdar Shaw Medical Foundation, Bangalore 560099, India.
⁶ Biocon Foundation, Bangalore 560100, India.
⁷ Christian Institute of Health Sciences and Research, Dimapur 797115, India.
⁸ Beckman Laser Institute & Medical Clinic, University of California, Irvine, CA 92617, USA.
⁹ Cochin Cancer Research Center, Kochi 683503, India.

Abstract

Convolutional neural networks have demonstrated excellent performance in oral cancer detection and classification. However, the end-to-end learning strategy makes CNNs hard to interpret, and it can be challenging to fully understand the decision-making procedure. Additionally, reliability is also a significant challenge for CNN based approaches. In this study, we proposed a neural network called the attention branch network (ABN), which combines the visual explanation and attention mechanisms to improve the recognition performance and interpret the decision-making simultaneously. We also embedded expert knowledge into the network by having human experts manually edit the attention maps for the attention mechanism. Our experiments have shown that ABN performs better than the original baseline network. By introducing the Squeeze-and-Excitation (SE) blocks to the network, the cross-validation accuracy increased further. Furthermore, we observed that some previously misclassified cases were correctly recognized after updating by manually editing the attention maps. The cross-validation accuracy increased from 0.846 to 0.875 with the ABN (Resnet18 as baseline), 0.877 with SE-ABN, and 0.903 after embedding expert knowledge. The proposed method provides an accurate, interpretable, and reliable oral cancer computer-aided diagnosis system through visual explanation, attention mechanisms, and expert knowledge embedding.

Keywords: attention branch network; attention map; attention mechanism; expert knowledge embedding; human-in-the-loop deep learning; visual explanation.

Abstract

Grants and funding