Sciweavers

IPM
2006

Automatic extraction of bilingual word pairs using inductive chain learning in various languages

13 years 11 months ago
Automatic extraction of bilingual word pairs using inductive chain learning in various languages
In this paper, we propose a new learning method for extracting bilingual word pairs from parallel corpora in various languages. In cross-language information retrieval, the system must deal with various languages. Therefore, automatic extraction of bilingual word pairs from parallel corpora with various languages is important. However, previous works based on statistical methods are insufficient because of the sparse data problem. Our learning method automatically acquires rules, which are effective to solve the sparse data problem, only from parallel corpora without any prior preparation of a bilingual resource (e.g., a bilingual dictionary, a machine translation system). We call this learning method Inductive Chain Learning (ICL). Moreover, the system using ICL can extract bilingual word pairs even from bilingual sentence pairs for which the grammatical structures of the source language differ from the grammatical structures of the target language because the acquired rules have the...
Hiroshi Echizen-ya, Kenji Araki, Yoshio Momouchi
Added 13 Dec 2010
Updated 13 Dec 2010
Type Journal
Year 2006
Where IPM
Authors Hiroshi Echizen-ya, Kenji Araki, Yoshio Momouchi
Comments (0)