Relative Performance Guarantees for Approximate Inference in Latent Dirichlet Allocation

14 years 3 months ago

Download www.cs.princeton.edu

Hierarchical probabilistic modeling of discrete data has emerged as a powerful tool for text analysis. Posterior inference in such models is intractable, and practitioners rely on approximate posterior inference methods such as variational inference or Gibbs sampling. There has been much research in designing better approximations, but there is yet little theoretical understanding of which of the available techniques are appropriate, and in which data analysis settings. In this paper we provide the beginnings of such understanding. We analyze the improvement that the recently proposed collapsed variational inference (CVB) provides over mean field variational inference (VB) in latent Dirichlet allocation. We prove that the difference in the tightness of the bound on the likelihood of a document decreases as O(k-1)+ log m/m, where k is the number of topics in the model and m is the number of words in a document. As a consequence, the advantage of CVB over VB is lost for long documents b...

Indraneel Mukherjee, David M. Blei

Real-time Traffic

Approximate Posterior Inference | Information Technology | NIPS 2008 | Posterior Inference | Variational Inference |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	NIPS
Authors	Indraneel Mukherjee, David M. Blei

Comments (0)

Sciweavers

Relative Performance Guarantees for Approximate Inference in Latent Dirichlet Allocation

Approximate Posterior Inference | Information Technology | NIPS 2008 | Posterior Inference | Variational Inference |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers