Sciweavers

13 search results - page 2 / 3
» Data Collection and IPR in Multilingual Parallel Corpora. Du...
Sort
View
ACL
2009
13 years 5 months ago
Extracting Paraphrases of Technical Terms from Noisy Parallel Software Corpora
In this paper, we study the problem of extracting technical paraphrases from a parallel software corpus, namely, a collection of duplicate bug reports. Paraphrase acquisition is a...
Xiaoyin Wang, David Lo, Jing Jiang, Lu Zhang, Hong...
IJCNLP
2005
Springer
14 years 28 days ago
Inversion Transduction Grammar Constraints for Mining Parallel Sentences from Quasi-Comparable Corpora
Abstract. We present a new implication of Wu’s (1997) Inversion Transduction Grammar (ITG) Hypothesis, on the problem of retrieving truly parallel sentence translations from larg...
Dekai Wu, Pascale Fung
LREC
2010
164views Education» more  LREC 2010»
13 years 9 months ago
Enhanced Infrastructure for Creation and Collection of Translation Resources
Statistical Machine Translation (MT) systems have achieved impressive results in recent years, due in large part to the increasing availability of parallel text for system trainin...
Zhiyi Song, Stephanie Strassel, Gary Krug, Kazuaki...
LREC
2010
162views Education» more  LREC 2010»
13 years 9 months ago
Construction of a Benchmark Data Set for Cross-lingual Word Sense Disambiguation
Given the recent trend to evaluate the performance of word sense disambiguation systems in a more application-oriented set-up, we report on the construction of a multilingual benc...
Els Lefever, Véronique Hoste
LREC
2008
66views Education» more  LREC 2008»
13 years 8 months ago
Sentence Alignment in DPC: Maximizing Precision, Minimizing Human Effort
Awide spectrum of multilingual applications have aligned parallel corpora as their prerequisite. The aim of the project described in this paper is to build a multilingual corpus w...
Julia S. Trushkina, Lieve Macken, Hans Paulussen