Sciweavers

COLING
2010

Syntax Based Reordering with Automatically Derived Rules for Improved Statistical Machine Translation

13 years 6 months ago
Syntax Based Reordering with Automatically Derived Rules for Improved Statistical Machine Translation
Syntax based reordering has been shown to be an effective way of handling word order differences between source and target languages in Statistical Machine Translation (SMT) systems. We present a simple, automatic method to learn rules that reorder source sentences to more closely match the target language word order using only a source side parse tree and automatically generated alignments. The resulting rules are applied to source language inputs as a pre-processing step and demonstrate significant improvements in SMT systems across a variety of languages pairs including English to Hindi, English to Spanish and English to French as measured on a variety of internal test sets as well as a public test set.
Karthik Visweswariah, Jiri Navratil, Jeffrey S. So
Added 13 May 2011
Updated 13 May 2011
Type Journal
Year 2010
Where COLING
Authors Karthik Visweswariah, Jiri Navratil, Jeffrey S. Sorensen, Vijil Chenthamarakshan, Nandakishore Kambhatla
Comments (0)