Deep Unlearning via Randomized Conditionally Independent Hessians

Ronak Mehta; Sourav Pal; Vikas Singh; Sathya N Ravi

doi:10.1109/cvpr52688.2022.01017

Deep Unlearning via Randomized Conditionally Independent Hessians

Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2022 Jun:2022:10412-10421. doi: 10.1109/cvpr52688.2022.01017. Epub 2022 Sep 27.

Authors

Ronak Mehta¹, Sourav Pal¹, Vikas Singh¹, Sathya N Ravi²

Affiliations

¹ University of Wisconsin-Madison.
² University of Illinois at Chicago.

Abstract

Recent legislation has led to interest in machine unlearning, i.e., removing specific training samples from a predictive model as if they never existed in the training dataset. Unlearning may also be required due to corrupted/adversarial data or simply a user's updated privacy requirement. For models which require no training (k-NN), simply deleting the closest original sample can be effective. But this idea is inapplicable to models which learn richer representations. Recent ideas leveraging optimization-based updates scale poorly with the model dimension d, due to inverting the Hessian of the loss function. We use a variant of a new conditional independence coefficient, L-CODEC, to identify a subset of the model parameters with the most semantic overlap on an individual sample level. Our approach completely avoids the need to invert a (possibly) huge matrix. By utilizing a Markov blanket selection, we premise that L-CODEC is also suitable for deep unlearning, as well as other applications in vision. Compared to alternatives, L-CODEC makes approximate unlearning possible in settings that would otherwise be infeasible, including vision models used for face recognition, person re-identification and NLP models that may require unlearning samples identified for exclusion. Code is available at https://github.com/vsingh-group/LCODEC-deep-unlearning.

Grants and funding

RF1 AG062336/AG/NIA NIH HHS/United States