A robust variational autoencoder using beta divergence

Haleh Akrami; Anand A Joshi; Jian Li; Sergül Aydöre; Richard M Leahy

doi:10.1016/j.knosys.2021.107886

A robust variational autoencoder using beta divergence

Knowl Based Syst. 2022 Feb 28:238:107886. doi: 10.1016/j.knosys.2021.107886. Epub 2021 Dec 10.

Authors

Haleh Akrami¹, Anand A Joshi¹, Jian Li^{2

3}, Sergül Aydöre⁴, Richard M Leahy¹

Affiliations

¹ Signal and Image Processing Institute, University of Southern California, Los Angeles, CA, USA.
² Athinoula A. Martinos Center for Biomedical Imaging Massachusetts General Hospital, Harvard Medical School, Charlestown, MA, USA.
³ Center for Neurotechnology and Neurorecovery, Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA.
⁴ Amazon Web Services, New York, NY, USA.

Abstract

The presence of outliers can severely degrade learned representations and performance of deep learning methods and hence disproportionately affect the training process, leading to incorrect conclusions about the data. For example, anomaly detection using deep generative models is typically only possible when similar anomalies (or outliers) are not present in the training data. Here we focus on variational autoencoders (VAEs). While the VAE is a popular framework for anomaly detection tasks, we observe that the VAE is unable to detect outliers when the training data contains anomalies that have the same distribution as those in test data. In this paper we focus on robustness to outliers in training data in VAE settings using concepts from robust statistics. We propose a variational lower bound that leads to a robust VAE model that has the same computational complexity as the standard VAE and contains a single automatically-adjusted tuning parameter to control the degree of robustness. We present mathematical formulations for robust variational autoencoders (RVAEs) for Bernoulli, Gaussian and categorical variables. The RVAE model is based on beta-divergence rather than the standard Kullback-Leibler (KL) divergence. We demonstrate the performance of our proposed β-divergence-based autoencoder for a variety of image and categorical datasets showing improved robustness to outliers both qualitatively and quantitatively. We also illustrate the use of our robust VAE for detection of lesions in brain images, formulated as an anomaly detection task. Finally, we suggest a method to tune the hyperparameter of RVAE which makes our model completely unsupervised.

Keywords: Outlier; RVAE; Robust anomaly detection; VAE; β divergence.

Abstract

Grants and funding