Contextual Modeling for Meeting Translation Using Unsupervised Word Sense Disambiguation

13 years 7 months ago

Download www.aclweb.org

In this paper we investigate the challenges of applying statistical machine translation to meeting conversations, with a particular view towards analyzing the importance of modeling contextual factors such as the larger discourse context and topic/domain information on translation performance. We describe the collection of a small corpus of parallel meeting data, the development of a statistical machine translation system in the absence of genre-matched training data, and we present a quantitative analysis of translation errors resulting from the lack of contextual modeling inherent in standard statistical machine translation systems. Finally, we demonstrate how the largest source of translation errors (lack of topic/domain knowledge) can be addressed by applying documentlevel, unsupervised word sense disambiguation, resulting in performance improvements over the baseline system.

Mei Yang, Katrin Kirchhoff

Real-time Traffic

COLING 2010 | Computational Linguistics | Machine Translation Systems | Statistical Machine Translation | Translation Errors |

claim paper

Post Info
More Details (n/a)

Added	13 May 2011
Updated	13 May 2011
Type	Journal
Year	2010
Where	COLING
Authors	Mei Yang, Katrin Kirchhoff

Comments (0)

Sciweavers

Contextual Modeling for Meeting Translation Using Unsupervised Word Sense Disambiguation

COLING 2010 | Computational Linguistics | Machine Translation Systems | Statistical Machine Translation | Translation Errors |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers