Acquiring Translation Equivalences of Multiword Expressions by Normalized Correlation Frequencies

13 years 10 months ago

Download aclweb.org

In this paper, we present an algorithm for extracting translations of any given multiword expression from parallel corpora. Given a multiword expression to be translated, the method involves extracting a short list of target candidate words from parallel corpora based on scores of normalized frequency, generating possible translations and filtering out common subsequences, and selecting the top-n possible translations using the Dice coefficient. Experiments show that our approach outperforms the word alignmentbased and other naive association-based methods. We also demonstrate that adopting the extracted translations can significantly improve the performance of the Moses machine translation system.

Ming-Hong Bai, Jia-Ming You, Keh-Jiann Chen, Jason

Real-time Traffic

EMNLP 2009 | Multiword Expression | Natural Language Processing | Parallel Corpora | Possible Translations |

claim paper

Post Info
More Details (n/a)

Added	17 Feb 2011
Updated	17 Feb 2011
Type	Journal
Year	2009
Where	EMNLP
Authors	Ming-Hong Bai, Jia-Ming You, Keh-Jiann Chen, Jason S. Chang

Comments (0)

Sciweavers

Acquiring Translation Equivalences of Multiword Expressions by Normalized Correlation Frequencies

EMNLP 2009 | Multiword Expression | Natural Language Processing | Parallel Corpora | Possible Translations |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers