A Test Statistic Estimation-Based Approach for Establishing Self-Interpretable CNN-Based Binary Classifiers

Sourya Sengupta; Mark A Anastasio

doi:10.1109/TMI.2023.3348699

A Test Statistic Estimation-Based Approach for Establishing Self-Interpretable CNN-Based Binary Classifiers

IEEE Trans Med Imaging. 2024 May;43(5):1753-1765. doi: 10.1109/TMI.2023.3348699. Epub 2024 May 2.

Authors

Sourya Sengupta, Mark A Anastasio

PMID: 38163307
PMCID: PMC11065575 (available on 2025-05-02)
DOI: 10.1109/TMI.2023.3348699

Abstract

Interpretability is highly desired for deep neural network-based classifiers, especially when addressing high-stake decisions in medical imaging. Commonly used post-hoc interpretability methods have the limitation that they can produce plausible but different interpretations of a given model, leading to ambiguity about which one to choose. To address this problem, a novel decision-theory-inspired approach is investigated to establish a self-interpretable model, given a pre-trained deep binary black-box medical image classifier. This approach involves utilizing a self-interpretable encoder-decoder model in conjunction with a single-layer fully connected network with unity weights. The model is trained to estimate the test statistic of the given trained black-box deep binary classifier to maintain a similar accuracy. The decoder output image, referred to as an equivalency map, is an image that represents a transformed version of the to-be-classified image that, when processed by the fixed fully connected layer, produces the same test statistic value as the original classifier. The equivalency map provides a visualization of the transformed image features that directly contribute to the test statistic value and, moreover, permits quantification of their relative contributions. Unlike the traditional post-hoc interpretability methods, the proposed method is self-interpretable, quantitative. Detailed quantitative and qualitative analyses have been performed with three different medical image binary classification tasks.

Publication types

Research Support, Non-U.S. Gov't
Research Support, N.I.H., Extramural

MeSH terms

Algorithms
Deep Learning
Humans
Image Interpretation, Computer-Assisted / methods
Image Processing, Computer-Assisted* / methods
Neural Networks, Computer*

Grants and funding

P41 EB031772/EB/NIBIB NIH HHS/United States