Semantic Interpretation for Convolutional Neural Networks: What Makes a Cat a Cat?

Adv Sci (Weinh). 2022 Dec;9(35):e2204723. doi: 10.1002/advs.202204723. Epub 2022 Oct 10.

Abstract

The interpretability of deep neural networks has attracted increasing attention in recent years, and several methods have been created to interpret the "black box" model. Fundamental limitations remain, however, that impede the pace of understanding the networks, especially the extraction of understandable semantic space. In this work, the framework of semantic explainable artificial intelligence (S-XAI) is introduced, which utilizes a sample compression method based on the distinctive row-centered principal component analysis (PCA) that is different from the conventional column-centered PCA to obtain common traits of samples from the convolutional neural network (CNN), and extracts understandable semantic spaces on the basis of discovered semantically sensitive neurons and visualization techniques. Statistical interpretation of the semantic space is also provided, and the concept of semantic probability is proposed. The experimental results demonstrate that S-XAI is effective in providing a semantic interpretation for the CNN, and offers broad usage, including trustworthiness assessment and semantic sample searching.

Keywords: convolutional neural network; interpretable machine learning; semantic space; trustworthiness assessment.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Artificial Intelligence*
  • Neural Networks, Computer
  • Semantics