Sciweavers

COLING
2010

Contextual Modeling for Meeting Translation Using Unsupervised Word Sense Disambiguation

13 years 7 months ago
Contextual Modeling for Meeting Translation Using Unsupervised Word Sense Disambiguation
In this paper we investigate the challenges of applying statistical machine translation to meeting conversations, with a particular view towards analyzing the importance of modeling contextual factors such as the larger discourse context and topic/domain information on translation performance. We describe the collection of a small corpus of parallel meeting data, the development of a statistical machine translation system in the absence of genre-matched training data, and we present a quantitative analysis of translation errors resulting from the lack of contextual modeling inherent in standard statistical machine translation systems. Finally, we demonstrate how the largest source of translation errors (lack of topic/domain knowledge) can be addressed by applying documentlevel, unsupervised word sense disambiguation, resulting in performance improvements over the baseline system.
Mei Yang, Katrin Kirchhoff
Added 13 May 2011
Updated 13 May 2011
Type Journal
Year 2010
Where COLING
Authors Mei Yang, Katrin Kirchhoff
Comments (0)