A Maximum Entropy Tagger with Unsupervised Hidden Markov Models

14 years 5 months ago

Download mastarpj.nict.go.jp

We describe a new tagging model where the states of a hidden Markov model (HMM) estimated by unsupervised learning are incorporated as the features in a maximum entropy model. Our method for exploiting unsupervised learning of a probabilistic model can reduce the cost of building taggers with no dictionary and a small annotated corpus. Experimental results on English POS tagging and Japanese word segmentation show that in both tasks our method greatly improves the tagging accuracy when the model is trained with a small annotated corpus. Furthermore, our English POS tagger achieved betterthan-state-of-the-art POS tagging accuracy (96.84%) when a large annotated corpus is available.

Jun'ichi Kazama, Yusuke Miyao, Jun-ichi Tsujii

Real-time Traffic

Natural Language Processing | NLPRS 2001 | POS Tagging Accuracy | Small Annotated Corpus | Unsupervised Learning |

claim paper

Post Info
More Details (n/a)

Added	30 Jul 2010
Updated	30 Jul 2010
Type	Conference
Year	2001
Where	NLPRS
Authors	Jun'ichi Kazama, Yusuke Miyao, Jun-ichi Tsujii

Comments (0)

Sciweavers

A Maximum Entropy Tagger with Unsupervised Hidden Markov Models

Natural Language Processing | NLPRS 2001 | POS Tagging Accuracy | Small Annotated Corpus | Unsupervised Learning |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers