A comparison of Bayesian estimators for unsupervised Hidden Markov Model POS taggers

14 years 4 months ago

Download aclweb.org

There is growing interest in applying Bayesian techniques to NLP problems. There are a number of different estimators for Bayesian models, and it is useful to know what kinds of tasks each does well on. This paper compares a variety of different Bayesian estimators for Hidden Markov Model POS taggers with various numbers of hidden states on data sets of different sizes. Recent papers have given contradictory results when comparing Bayesian estimators to Expectation Maximization (EM) for unsupervised HMM POS tagging, and we show that the difference in reported results is largely due to differences in the size of the training data and the number of states in the HMM. We invesigate a variety of samplers for HMMs, including some that these earlier papers did not study. We find that all of Gibbs samplers do well with small data sets and few states, and that Variational Bayes does well on large data sets and is competitive with the Gibbs samplers. In terms of times of convergence, we find t...

Jianfeng Gao, Mark Johnson

Real-time Traffic

Bayesian Estimators | Data Sets | EMNLP 2008 | Large Data Sets | Natural Language Processing |

claim paper

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	EMNLP
Authors	Jianfeng Gao, Mark Johnson

Comments (0)

Sciweavers

A comparison of Bayesian estimators for unsupervised Hidden Markov Model POS taggers

Bayesian Estimators | Data Sets | EMNLP 2008 | Large Data Sets | Natural Language Processing |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers