Sciweavers

81 search results - page 6 / 17
» Using cognates to align sentences in bilingual corpora
Sort
View
NAACL
2010
13 years 5 months ago
Extracting Parallel Sentences from Comparable Corpora using Document Level Alignment
The quality of a statistical machine translation (SMT) system is heavily dependent upon the amount of parallel sentences used in training. In recent years, there have been several...
Jason R. Smith, Chris Quirk, Kristina Toutanova
IJCNLP
2005
Springer
14 years 1 months ago
French-English Terminology Extraction from Comparable Corpora
This article presents a method of extracting bilingual lexica composed of single-word terms (SWTs) and multi-word terms (MWTs) from comparable corpora of a technical domain. First,...
Béatrice Daille, Emmanuel Morin
NAACL
2003
13 years 9 months ago
Word Sense Acquisition from Bilingual Comparable Corpora
Manually constructing an inventory of word senses has suffered from problems including high cost, arbitrary assignment of meaning to words, and mismatch to domains. To overcome th...
Hiroyuki Kaji
EACL
2003
ACL Anthology
13 years 9 months ago
Automatic Construction of Machine Translation Knowledge Using Translation Literalness
When machine translation (MT) knowledge is automatically constructed from bilingual corpora, redundant rules are acquired due to translation variety. These rules increase ambiguit...
Kenji Imamura, Eiichiro Sumita, Yuji Matsumoto
LREC
2008
109views Education» more  LREC 2008»
13 years 9 months ago
Creating Sentence-Aligned Parallel Text Corpora from a Large Archive of Potential Parallel Text using BITS and Champollion
Parallel text is one of the most valuable resources for development of statistical machine translation systems and other NLP applications. The Linguistic Data Consortium (LDC) has...
Kazuaki Maeda, Xiaoyi Ma, Stephanie Strassel