Learning Deep Boltzmann Machines using Adaptive MCMC

14 years 1 months ago

Download web.mit.edu

When modeling high-dimensional richly structured data, it is often the case that the distribution defined by the Deep Boltzmann Machine (DBM) has a rough energy landscape with many local minima separated by high energy barriers. The commonly used Gibbs sampler tends to get trapped in one local mode, which often results in unstable learning dynamics and leads to poor parameter estimates. In this paper, we concentrate on learning DBM's using adaptive MCMC algorithms. We first show a close connection between Fast PCD and adaptive MCMC. We then develop a Coupled Adaptive Simulated Tempering algorithm that can be used to better explore a highly multimodal energy landscape. Finally, we demonstrate that the proposed algorithm considerably improves parameter estimates, particularly when learning large-scale DBM's.

Ruslan Salakhutdinov

Real-time Traffic

Adaptive Mcmc | Energy Landscape | ICML 2010 | Machine Learning | Parameter Estimates |

claim paper

Post Info
More Details (n/a)

Added	09 Nov 2010
Updated	09 Nov 2010
Type	Conference
Year	2010
Where	ICML
Authors	Ruslan Salakhutdinov

Comments (0)

Sciweavers

Learning Deep Boltzmann Machines using Adaptive MCMC

Adaptive Mcmc | Energy Landscape | ICML 2010 | Machine Learning | Parameter Estimates |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers