A Bootstrapping Method for Extracting Bilingual Text Pairs

15 years 8 months ago

Download acl.ldc.upenn.edu

This paper proposes a method for extracting bilingual text pairs from a comparable corpus. The basic idea of the method is to apply bootstrapping to an existing corpusbased cross-language information retrieval (CLIR) approach. We conducted preliminary tests with English and Japanese bilingual corpora. The bootstrapping method led to much better results for the task of extracting translation pairs compared with a corpus-based CLIR method without bootstrapping, and the extracted translation pairs could be useftfl training data for improving results of the corpus-based CLIR method.

Hiroshi Masuichi, Raymond Flournoy, Stefan Kaufman

Real-time Traffic

Bilingual Text Pairs | Bootstrapping Method | COLING 2000 | COLING 2008 | Corpus-based Clir Method |

claim paper

» Automatically Creating Bilingual Lexicons for Machine Translation from Bilingual Text

» Mining Bilingual Data from the Web with Adaptively Learnt Patterns

» Automatic extraction of bilingual word pairs using inductive chain learning in various lan...

» Automatically Harvesting KatakanaEnglish Term Pairs from Search Engine Query Logs

» Learning Method for Automatic Acquisition of Translation Knowledge

» Learning Translation Templates From Bilingual Text

» SemiSupervised Learning of Partial Cognates Using Bilingual Bootstrapping

» Automatic Extraction of Translational JapaneseKATAKANA and English Word Pairs

Post Info
More Details (n/a)

Added	01 Nov 2010
Updated	01 Nov 2010
Type	Conference
Year	2000
Where	COLING
Authors	Hiroshi Masuichi, Raymond Flournoy, Stefan Kaufmann, Stanley Peters

Comments (0)

Sciweavers

A Bootstrapping Method for Extracting Bilingual Text Pairs

Bilingual Text Pairs | Bootstrapping Method | COLING 2000 | COLING 2008 | Corpus-based Clir Method |

Explore & Download

Productivity Tools

Sciweavers