Collective dynamics of repeated inference in variational autoencoder rapidly find cluster structure

Yoshihiro Nagano; Ryo Karakida; Masato Okada

doi:10.1038/s41598-020-72593-4

Collective dynamics of repeated inference in variational autoencoder rapidly find cluster structure

Sci Rep. 2020 Sep 29;10(1):16001. doi: 10.1038/s41598-020-72593-4.

Authors

Yoshihiro Nagano^{1

2}, Ryo Karakida³, Masato Okada^{4

5}

Affiliations

¹ Department of Complexity Science and Engineering, The University of Tokyo, Chiba, 277-8561, Japan.
² Research Fellow of the Japan Society for the Promotion of Science, Tokyo, 102-0083, Japan.
³ Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology, Tokyo, 135-0064, Japan.
⁴ Department of Complexity Science and Engineering, The University of Tokyo, Chiba, 277-8561, Japan. okada@edu.k.u-tokyo.ac.jp.
⁵ Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology, Tokyo, 135-0064, Japan. okada@edu.k.u-tokyo.ac.jp.

Abstract

Deep neural networks are good at extracting low-dimensional subspaces (latent spaces) that represent the essential features inside a high-dimensional dataset. Deep generative models represented by variational autoencoders (VAEs) can generate and infer high-quality datasets, such as images. In particular, VAEs can eliminate the noise contained in an image by repeating the mapping between latent and data space. To clarify the mechanism of such denoising, we numerically analyzed how the activity pattern of trained networks changes in the latent space during inference. We considered the time development of the activity pattern for specific data as one trajectory in the latent space and investigated the collective behavior of these inference trajectories for many data. Our study revealed that when a cluster structure exists in the dataset, the trajectory rapidly approaches the center of the cluster. This behavior was qualitatively consistent with the concept retrieval reported in associative memory models. Additionally, the larger the noise contained in the data, the closer the trajectory was to a more global cluster. It was demonstrated that by increasing the number of the latent variables, the trend of the approach a cluster center can be enhanced, and the generalization ability of the VAE can be improved.

Publication types

Research Support, Non-U.S. Gov't