Empirical Analysis of the Divergence of Gibbs Sampling Based Learning Algorithms for Restricted Boltzmann Machines

13 years 5 months ago

Download www.neuroinformatik.ruhr-uni-bochum.de

Abstract. Learning algorithms relying on Gibbs sampling based stochastic approximations of the log-likelihood gradient have become a common way to train Restricted Boltzmann Machines (RBMs). We study three of these methods, Contrastive Divergence (CD) and its refined variants Persistent CD (PCD) and Fast PCD (FPCD). As the approximations are biased, the maximum of the log-likelihood is not necessarily obtained. Recently, it has been shown that CD, PCD, and FPCD can even lead to a steady decrease of the log-likelihood during learning. Taking artificial data sets from the literature we study these divergence effects in more detail. Our results indicate that the log-likelihood seems to diverge especially if the target distribution is difficult to learn for the RBM. The decrease of the likelihood can not be detected by an increase of the reconstruction error, which has been proposed as a stopping criterion for CD learning. Weight-decay with a carefully chosen weight-decay-parameter can pre...

Asja Fischer, Christian Igel

Real-time Traffic

Boltzmann Machines | Contrastive Divergence | Gibbs Sampling | ICANN 2010 | Neural Networks |

claim paper

Added	09 Nov 2010
Updated	09 Nov 2010
Type	Conference
Year	2010
Where	ICANN
Authors	Asja Fischer, Christian Igel

Sciweavers

Empirical Analysis of the Divergence of Gibbs Sampling Based Learning Algorithms for Restricted Boltzmann Machines

Boltzmann Machines | Contrastive Divergence | Gibbs Sampling | ICANN 2010 | Neural Networks |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers