Sciweavers

NAACL
2004
14 years 24 days ago
What's in a translation rule?
We propose a theory that gives formal semantics to word-level alignments defined over parallel corpora. We use our theory to introduce a linear algorithm that can be used to deriv...
Michel Galley, Mark Hopkins, Kevin Knight, Daniel ...
EACL
2003
ACL Anthology
14 years 25 days ago
Empirical Methods for Compound Splitting
Compounded words are a challenge for NLP applications such as machine translation (MT). We introduce methods to learn splitting rules from monolingual and parallel corpora. We eva...
Philipp Koehn, Kevin Knight
ECIR
2006
Springer
14 years 25 days ago
Automatic Acquisition of Chinese-English Parallel Corpus from the Web
Parallel corpora are a valuable resource for tasks such as cross-language information retrieval and data-driven natural language processing systems. Previously only small scale cor...
Ying Zhang, Ke Wu, Jianfeng Gao, Phil Vines
LREC
2008
115views Education» more  LREC 2008»
14 years 26 days ago
Experiments on Processing Overlapping Parallel Corpora
The number and sizes of parallel corpora keep growing, which makes it necessary to have automatic methods of processing them: combining, checking and improving corpora quality, et...
Mark Fishel, Heiki Jaan Kaalep
LREC
2010
189views Education» more  LREC 2010»
14 years 27 days ago
Automatic Acquisition of Parallel Corpora from Websites with Dynamic Content
Parallel corpora are indispensable resources for a variety of multilingual natural language processing tasks. This paper presents a technique for fully automatic construction of c...
Yulia Tsvetkov, Shuly Wintner
LREC
2010
213views Education» more  LREC 2010»
14 years 27 days ago
Active Learning and Crowd-Sourcing for Machine Translation
In recent years, corpus based approaches to machine translation have become predominant, with Statistical Machine Translation (SMT) being the most actively progressing area. Succe...
Vamshi Ambati, Stephan Vogel, Jaime G. Carbonell
ACL
2007
14 years 27 days ago
Assisting Translators in Indirect Lexical Transfer
We present the design and evaluation of a translator’s amenuensis that uses comparable corpora to propose and rank nonliteral solutions to the translation of expressions from th...
Bogdan Babych, Anthony Hartley, Serge Sharoff, Olg...
AMTA
1998
Springer
14 years 3 months ago
Parallel Strands: A Preliminary Investigation into Mining the Web for Bilingual Text
Abstract. Parallel corpora are a valuable resource for machine translation, but at present their availability and utility is limited by genreand domain-speci city, licensing restri...
Philip Resnik
KES
2005
Springer
14 years 5 months ago
Learning Method for Automatic Acquisition of Translation Knowledge
This paper presents a new learning method for automatic acquisition of translation knowledge from parallel corpora. We apply this learning method to automatic extraction of bilingu...
Hiroshi Echizen-ya, Kenji Araki, Yoshio Momouchi
TSD
2007
Springer
14 years 5 months ago
Using Query-Relevant Documents Pairs for Cross-Lingual Information Retrieval
The world wide web is a natural setting for cross-lingual information retrieval. The European Union is a typical example of a multilingual scenario, where multiple users have to de...
David Pinto, Alfons Juan, Paolo Rosso