Representational power of restricted boltzmann machines and deep belief networks

Nicolas Le Roux; Yoshua Bengio

doi:10.1162/neco.2008.04-07-510

Representational power of restricted boltzmann machines and deep belief networks

Neural Comput. 2008 Jun;20(6):1631-49. doi: 10.1162/neco.2008.04-07-510.

Authors

Nicolas Le Roux¹, Yoshua Bengio

Affiliation

¹ Département Informatique et Recherche Opérationnelle, Université de Montréal, Montréal, Québec, H3C 3J7, Canada. lerouxni@iro.umontreal.ca

PMID: 18254699
DOI: 10.1162/neco.2008.04-07-510

Abstract

Deep belief networks (DBN) are generative neural network models with many layers of hidden explanatory factors, recently introduced by Hinton, Osindero, and Teh (2006) along with a greedy layer-wise unsupervised learning algorithm. The building block of a DBN is a probabilistic model called a restricted Boltzmann machine (RBM), used to represent one layer of the model. Restricted Boltzmann machines are interesting because inference is easy in them and because they have been successfully used as building blocks for training deeper models. We first prove that adding hidden units yields strictly improved modeling power, while a second theorem shows that RBMs are universal approximators of discrete distributions. We then study the question of whether DBNs with more layers are strictly more powerful in terms of representational power. This suggests a new and less greedy criterion for training RBMs within DBNs.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms*
Animals
Computer Simulation
Humans
Learning / physiology*
Models, Statistical*
Neural Networks, Computer*
Signal Processing, Computer-Assisted