Extraction of Lexical Translations from Non-Aligned Corpora

14 years 1 months ago

Download acl.ldc.upenn.edu

A method for extracting lexical translations from non-aligned corpora is proposed to cope with the unavailability of large aligned corpus. The assumption that "translations of two co-occurring words in a source language also co-occur in the target language" is adopted and represented in the stochastic matrix formulation. The translation matrix provides the co-occurring information translated from the source into the target. This translated co-occurring information should resemble that of the original in the target when the ambiguity of the translational relation is resolved. An algorithm to obtain the best translation matrix is introduced. Some experiments were performed to evaluate the effectiveness of the ambiguity resolution and the refinement of the dictionary.

Kumiko Tanaka, Hideya Iwasaki

Real-time Traffic

Co-occurring Information | COLING 1996 | COLING 2008 | Stochastic Matrix Formulation | Translation Matrix |

claim paper

Post Info
More Details (n/a)

Added	02 Nov 2010
Updated	02 Nov 2010
Type	Conference
Year	1996
Where	COLING
Authors	Kumiko Tanaka, Hideya Iwasaki

Comments (0)

Sciweavers

Extraction of Lexical Translations from Non-Aligned Corpora

Co-occurring Information | COLING 1996 | COLING 2008 | Stochastic Matrix Formulation | Translation Matrix |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers