Multi-align: Combining Linguistic and Statistical Techniques to Improve Alignments for Adaptable MT

16 years 1 days ago

Download www.speech.sri.com

Abstract. An adaptable statistical or hybrid MT system relies heavily on the quality of word-level alignments of real-world data. Statistical alignment approaches provide a reasonable initial estimate for word alignment. However, they cannot handle certain types of linguistic phenomena such as long-distance dependencies and structural diﬀerences between languages. We address this issue in Multi-Align, a new framework for incremental testing of diﬀerent alignment algorithms and their combinations. Our design allows users to tune their systems to the properties of a particular genre/domain while still beneﬁting from general linguistic knowledge associated with a language pair. We demonstrate that a combination of statistical and linguistically-informed alignments can resolve translation divergences during the alignment process.

Necip Fazil Ayan, Bonnie J. Dorr, Nizar Habash

Real-time Traffic

AMTA 2004 | Diﬀerent Alignment Algorithms | Information Management | Statistical Alignment | Word Alignment |

claim paper

» Combining Morphemebased Machine Translation with Postprocessing Morpheme Prediction

» BoostingBased System Combination for Machine Translation

» Loglinear weight optimisation via Bayesian Adaptation in Statistical Machine Translation

Post Info
More Details (n/a)

Added	30 Jun 2010
Updated	30 Jun 2010
Type	Conference
Year	2004
Where	AMTA
Authors	Necip Fazil Ayan, Bonnie J. Dorr, Nizar Habash

Comments (0)

Sciweavers

Multi-align: Combining Linguistic and Statistical Techniques to Improve Alignments for Adaptable MT

AMTA 2004 | Diﬀerent Alignment Algorithms | Information Management | Statistical Alignment | Word Alignment |

Explore & Download

Productivity Tools

Sciweavers