An Unsupervised Model for Joint Phrase Alignment and Extraction

13 years 11 months ago

Download www.phontron.com

We present an unsupervised model for joint phrase alignment and extraction using nonparametric Bayesian methods and inversion transduction grammars (ITGs). The key contribution is that phrases of many granularities are included directly in the model through the use of a novel formulation that memorizes phrases generated not only by terminal, but also non-terminal symbols. This allows for a completely probabilistic model that is able to create a phrase table that achieves competitive accuracy on phrase-based machine translation tasks directly from unaligned sentence pairs. Experiments on several language pairs demonstrate that the proposed model matches the accuracy of traditional two-step word alignment/phrase extraction approach while reducing the phrase table to a fraction of the original size.

Graham Neubig, Taro Watanabe, Eiichiro Sumita, Shi

Real-time Traffic

ACL 2011 | Computational Linguistics | Nonparametric Bayesian Methods | Sentence Pairs | Word Alignment |

claim paper

Post Info
More Details (n/a)

Added	23 Aug 2011
Updated	23 Aug 2011
Type	Journal
Year	2011
Where	ACL
Authors	Graham Neubig, Taro Watanabe, Eiichiro Sumita, Shinsuke Mori, Tatsuya Kawahara

Comments (0)

Sciweavers

An Unsupervised Model for Joint Phrase Alignment and Extraction

ACL 2011 | Computational Linguistics | Nonparametric Bayesian Methods | Sentence Pairs | Word Alignment |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers