Model agnostic generation of counterfactual explanations for molecules

Geemi P Wellawatte; Aditi Seshadri; Andrew D White

doi:10.1039/d1sc05259d

Model agnostic generation of counterfactual explanations for molecules

Chem Sci. 2022 Feb 16;13(13):3697-3705. doi: 10.1039/d1sc05259d. eCollection 2022 Mar 30.

Authors

Geemi P Wellawatte¹, Aditi Seshadri², Andrew D White²

Affiliations

¹ Department of Chemistry, University of Rochester Rochester NY USA.
² Department of Chemical Engineering, University of Rochester Rochester NY USA andrew.white@rochester.edu.

Abstract

An outstanding challenge in deep learning in chemistry is its lack of interpretability. The inability of explaining why a neural network makes a prediction is a major barrier to deployment of AI models. This not only dissuades chemists from using deep learning predictions, but also has led to neural networks learning spurious correlations that are difficult to notice. Counterfactuals are a category of explanations that provide a rationale behind a model prediction with satisfying properties like providing chemical structure insights. Yet, counterfactuals have been previously limited to specific model architectures or required reinforcement learning as a separate process. In this work, we show a universal model-agnostic approach that can explain any black-box model prediction. We demonstrate this method on random forest models, sequence models, and graph neural networks in both classification and regression.

Grants and funding

R35 GM137966/GM/NIGMS NIH HHS/United States