Marginal Contribution Feature Importance - an Axiomatic Approach for Explaining Data

Amnon Catav; Boyang Fu; Yazeed Zoabi; Ahuva Weiss-Meilik; Noam Shomron; Jason Ernst; Sriram Sankararaman; Ran Gilad-Bachrach

Marginal Contribution Feature Importance - an Axiomatic Approach for Explaining Data

Proc Mach Learn Res. 2021 Jul:139:1324-1335.

Authors

Amnon Catav¹, Boyang Fu², Yazeed Zoabi³, Ahuva Weiss-Meilik⁴, Noam Shomron³, Jason Ernst^{2

5

6}, Sriram Sankararaman^{2

5

7}, Ran Gilad-Bachrach⁸

Affiliations

¹ School of Computer Science, Tel-Aviv University, Tel-Aviv, Israel.
² Computer Science Department, University of California, Los Angeles, USA.
³ Faculty of Medicine, Tel-Aviv University, Tel-Aviv, Israel.
⁴ I-Medata AI Center, Tel Aviv Sourasky Medical Center, Tel-Aviv, Israel.
⁵ Department of Computational Medicine, University of California, Los Angeles, USA.
⁶ Department of Biological Chemistry, University of California, Los Angeles, USA.
⁷ Department of Human Genetics, University of California, Los Angeles, USA.
⁸ Department of Biomedical Engineering, Tel-Aviv University, Tel-Aviv, Israel.

PMID: 34568830
PMCID: PMC8460841

Abstract

In recent years, methods were proposed for assigning feature importance scores to measure the contribution of individual features. While in some cases the goal is to understand a specific model, in many cases the goal is to understand the contribution of certain properties (features) to a real-world phenomenon. Thus, a distinction has been made between feature importance scores that explain a model and scores that explain the data. When explaining the data, machine learning models are used as proxies in settings where conducting many real-world experiments is expensive or prohibited. While existing feature importance scores show great success in explaining models, we demonstrate their limitations when explaining the data, especially in the presence of correlations between features. Therefore, we develop a set of axioms to capture properties expected from a feature importance score when explaining data and prove that there exists only one score that satisfies all of them, the Marginal Contribution Feature Importance (MCI). We analyze the theoretical properties of this score function and demonstrate its merits empirically.

Abstract

Grants and funding