Sciweavers

COLING
2000

A Bootstrapping Method for Extracting Bilingual Text Pairs

14 years 1 months ago
A Bootstrapping Method for Extracting Bilingual Text Pairs
This paper proposes a method for extracting bilingual text pairs from a comparable corpus. The basic idea of the method is to apply bootstrapping to an existing corpusbased cross-language information retrieval (CLIR) approach. We conducted preliminary tests with English and Japanese bilingual corpora. The bootstrapping method led to much better results for the task of extracting translation pairs compared with a corpus-based CLIR method without bootstrapping, and the extracted translation pairs could be useftfl training data for improving results of the corpus-based CLIR method.
Hiroshi Masuichi, Raymond Flournoy, Stefan Kaufman
Added 01 Nov 2010
Updated 01 Nov 2010
Type Conference
Year 2000
Where COLING
Authors Hiroshi Masuichi, Raymond Flournoy, Stefan Kaufmann, Stanley Peters
Comments (0)