Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

228

ACL
2011

211views Computational Linguistics» more ACL 2011»

Rare Word Translation Extraction from Aligned Comparable Documents

14 years 10 months ago

Rare Word Translation Extraction from Aligned Comparable Documents

Download eprochasson.free.fr

We present a ﬁrst known result of high precision rare word bilingual extraction from comparable corpora, using aligned comparable documents and supervised classiﬁcation. We incorporate two features, a context-vector similarity and a co-occurrence model between words in aligned documents in a machine learning approach. We test our hypothesis on different pairs of languages and corpora. We obtain very high F-Measure between 80% and 98% for recognizing and extracting correct translations for rare terms (from 1 to 5 occurrences). Moreover, we show that our system can be trained on a pair of languages and test on a different pair of languages, obtaining a F-Measure of 77% for the classiﬁcation of Chinese-English translations using a training corpus of Spanish-French. Our method is therefore even applicable to low languages without training data.

Emmanuel Prochasson, Pascale Fung

Real-time Traffic

ACL 2011 | Comparable Corpora | Computational Linguistics | Rare Terms | Rare Word |

claim paper

Related Content

» Mining Parenthetical Translations from the Web by Word Alignment

» Finding translations for lowfrequency words in comparable corpora

» Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment

» Extracting Multilingual Topics from Unaligned Comparable Corpora

» ACCURAT Toolkit for MultiLevel Alignment and Information Extraction from Comparable Corpor...

» Phrase Translation Extraction from Aligned Parallel Corpora Using Suffix Arrays and Relate...

» HMM Word and Phrase Alignment for Statistical Machine Translation

» Clickthroughbased translation models for web search from word models to phrase models

» Collocation Extraction Using Monolingual Word Alignment Method

Post Info
More Details (n/a)

Added	23 Aug 2011
Updated	23 Aug 2011
Type	Journal
Year	2011
Where	ACL
Authors	Emmanuel Prochasson, Pascale Fung

Comments (0)