Variational Information Bottleneck for Unsupervised Clustering: Deep Gaussian Mixture Embedding

Entropy (Basel). 2020 Feb 13;22(2):213. doi: 10.3390/e22020213.

Abstract

In this paper, we develop an unsupervised generative clustering framework that combines the variational information bottleneck and the Gaussian mixture model. Specifically, in our approach, we use the variational information bottleneck method and model the latent space as a mixture of Gaussians. We derive a bound on the cost function of our model that generalizes the Evidence Lower Bound (ELBO) and provide a variational inference type algorithm that allows computing it. In the algorithm, the coders' mappings are parametrized using neural networks, and the bound is approximated by Markov sampling and optimized with stochastic gradient descent. Numerical results on real datasets are provided to support the efficiency of our method.

Keywords: Gaussian mixture model; clustering; information bottleneck; unsupervised learning.