Automatic extraction of bilingual terms from a Chinese-Japanese parallel corpus

16 years 2 months ago

Download www.r.dl.itc.u-tokyo.ac.jp

This paper proposes a new approach for the automatic extraction of bilingual terms from a domain-specific bilingual parallel corpus. We combine existing monolingual term extractor and a word alignment tool to extract bilingual terms. Our method is different from those past studies as we simply use a word alignment tool to extract multi-words terms, and we use one monolingual term extractor for both of languages to reduce extraction imbalance. We obtained a good precision and an improved BLEU score in our experiment based on a Chinese-Japanese parallel corpus. Categories and Subject Descriptors D.3.3 [Artificial Intelligence]: Natural Language Processing – machine translation General Terms Languages Keywords Automatic extraction, bilingual corpus, bilingual term, word alignment, segmentation, multi-words term

Xiaorong Fan, Nobuyuki Shimizu, Hiroshi Nakagawa

Real-time Traffic

Bilingual Term | Communications | IUCS 2009 | Monolingual Term Extractor | Word Alignment |

claim paper

Post Info
More Details (n/a)

Added	28 May 2010
Updated	28 May 2010
Type	Conference
Year	2009
Where	IUCS
Authors	Xiaorong Fan, Nobuyuki Shimizu, Hiroshi Nakagawa

Comments (0)

Sciweavers

Automatic extraction of bilingual terms from a Chinese-Japanese parallel corpus

Bilingual Term | Communications | IUCS 2009 | Monolingual Term Extractor | Word Alignment |

Explore & Download

Productivity Tools

Sciweavers