Semantic Interpretation for Convolutional Neural Networks: What Makes a Cat a Cat?

Hao Xu; Yuntian Chen; Dongxiao Zhang

doi:10.1002/advs.202204723

Semantic Interpretation for Convolutional Neural Networks: What Makes a Cat a Cat?

Adv Sci (Weinh). 2022 Dec;9(35):e2204723. doi: 10.1002/advs.202204723. Epub 2022 Oct 10.

Authors

Hao Xu¹, Yuntian Chen², Dongxiao Zhang^{3

4}

Affiliations

¹ BIC-ESAT, ERE, and SKLTCS, College of Engineering, Peking University, Beijing, 100871, P. R. China.
² Eastern Institute for Advanced Study, Yongriver Institute of Technology, Ningbo, Zhejiang, 315200, P. R. China.
³ National Center for Applied Mathematics Shenzhen (NCAMS), Southern University of Science and Technology, Shenzhen, Guangdong, 518055, P. R. China.
⁴ Department of Mathematics and Theories, Peng Cheng Laboratory, Shenzhen, Guangdong, 518000, P. R. China.

Abstract

The interpretability of deep neural networks has attracted increasing attention in recent years, and several methods have been created to interpret the "black box" model. Fundamental limitations remain, however, that impede the pace of understanding the networks, especially the extraction of understandable semantic space. In this work, the framework of semantic explainable artificial intelligence (S-XAI) is introduced, which utilizes a sample compression method based on the distinctive row-centered principal component analysis (PCA) that is different from the conventional column-centered PCA to obtain common traits of samples from the convolutional neural network (CNN), and extracts understandable semantic spaces on the basis of discovered semantically sensitive neurons and visualization techniques. Statistical interpretation of the semantic space is also provided, and the concept of semantic probability is proposed. The experimental results demonstrate that S-XAI is effective in providing a semantic interpretation for the CNN, and offers broad usage, including trustworthiness assessment and semantic sample searching.

Keywords: convolutional neural network; interpretable machine learning; semantic space; trustworthiness assessment.

Semantic Interpretation for Convolutional Neural Networks: What Makes a Cat a Cat?

Authors

Affiliations

Abstract

Publication types

MeSH terms

Grants and funding