Sciweavers

EMNLP
2009

Acquiring Translation Equivalences of Multiword Expressions by Normalized Correlation Frequencies

13 years 9 months ago
Acquiring Translation Equivalences of Multiword Expressions by Normalized Correlation Frequencies
In this paper, we present an algorithm for extracting translations of any given multiword expression from parallel corpora. Given a multiword expression to be translated, the method involves extracting a short list of target candidate words from parallel corpora based on scores of normalized frequency, generating possible translations and filtering out common subsequences, and selecting the top-n possible translations using the Dice coefficient. Experiments show that our approach outperforms the word alignmentbased and other naive association-based methods. We also demonstrate that adopting the extracted translations can significantly improve the performance of the Moses machine translation system.
Ming-Hong Bai, Jia-Ming You, Keh-Jiann Chen, Jason
Added 17 Feb 2011
Updated 17 Feb 2011
Type Journal
Year 2009
Where EMNLP
Authors Ming-Hong Bai, Jia-Ming You, Keh-Jiann Chen, Jason S. Chang
Comments (0)