Weighted contrastive divergence

Neural Netw. 2019 Jun:114:147-156. doi: 10.1016/j.neunet.2018.09.013. Epub 2018 Oct 15.

Abstract

Learning algorithms for energy based Boltzmann architectures that rely on gradient descent are in general computationally prohibitive, typically due to the exponential number of terms involved in computing the partition function. In this way one has to resort to approximation schemes for the evaluation of the gradient. This is the case of Restricted Boltzmann Machines (RBM) and its learning algorithm Contrastive Divergence (CD). It is well-known that CD has a number of shortcomings, and its approximation to the gradient has several drawbacks. Overcoming these defects has been the basis of much research and new algorithms have been devised, such as persistent CD. In this manuscript we propose a new algorithm that we call Weighted CD (WCD), built from small modifications of the negative phase in standard CD. However small these modifications may be, experimental work reported in this paper suggests that WCD provides a significant improvement over standard CD and persistent CD at a small additional computational cost.

Keywords: Contrastive divergence; Neural networks; Restricted Boltzmann machine.

MeSH terms

  • Algorithms
  • Neural Networks, Computer*