HM-BiTAM: Bilingual Topic Exploration, Word Alignment, and Translation

14 years 2 months ago

Download books.nips.cc

We present a novel paradigm for statistical machine translation (SMT), based on a joint modeling of word alignment and the topical aspects underlying bilingual document-pairs, via a hidden Markov Bilingual Topic AdMixture (HM-BiTAM). In this paradigm, parallel sentence-pairs from a parallel document-pair are coupled via a certain semantic-ﬂow, to ensure coherence of topical context in the alignment of mapping words between languages, likelihood-based training of topic-dependent translational lexicons, as well as in the inference of topic representations in each language. The learned HM-BiTAM can not only display topic patterns like methods such as LDA [1], but now for bilingual corpora; it also offers a principled way of inferring optimal translation using document context. Our method integrates the conventional model of HMM — a key component for most of the state-of-the-art SMT systems, with the recently proposed BiTAM model [10]; we report an extensive empirical analysis (in man...

Bing Zhao, Eric P. Xing

Real-time Traffic

Bilingual Topic | Bilingual Topic AdMixture | Information Technology | NIPS 2007 | Word Alignment |

claim paper

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2007
Where	NIPS
Authors	Bing Zhao, Eric P. Xing

Comments (0)

Sciweavers

HM-BiTAM: Bilingual Topic Exploration, Word Alignment, and Translation

Bilingual Topic | Bilingual Topic AdMixture | Information Technology | NIPS 2007 | Word Alignment |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers