Parallel Corpora | Sciweavers

162

NAACL
2004

99views Computational Linguistics» more NAACL 2004»

15 years 8 months ago

We propose a theory that gives formal semantics to word-level alignments defined over parallel corpora. We use our theory to introduce a linear algorithm that can be used to deriv...

Michel Galley, Mark Hopkins, Kevin Knight, Daniel ...

claim paper

Read More »

169

Voted

EACL
2003
ACL Anthology

141views Natural Language Processing» more EACL 2003»

Empirical Methods for Compound Splitting

15 years 8 months ago

Download www.iccs.informatics.ed.ac.uk

Compounded words are a challenge for NLP applications such as machine translation (MT). We introduce methods to learn splitting rules from monolingual and parallel corpora. We eva...

Philipp Koehn, Kevin Knight

claim paper

Read More »

160

click to vote

ECIR
2006
Springer

143views Information Technology» more ECIR 2006»

Automatic Acquisition of Chinese-English Parallel Corpus from the Web

15 years 8 months ago

Download research.microsoft.com

Parallel corpora are a valuable resource for tasks such as cross-language information retrieval and data-driven natural language processing systems. Previously only small scale cor...

Ying Zhang, Ke Wu, Jianfeng Gao, Phil Vines

claim paper

Read More »

136

click to vote

LREC
2008

115views Education» more LREC 2008»

Experiments on Processing Overlapping Parallel Corpora

15 years 8 months ago

Download www.lrec-conf.org

The number and sizes of parallel corpora keep growing, which makes it necessary to have automatic methods of processing them: combining, checking and improving corpora quality, et...

Mark Fishel, Heiki Jaan Kaalep

claim paper

Read More »

162

click to vote

LREC
2010

189views Education» more LREC 2010»

Automatic Acquisition of Parallel Corpora from Websites with Dynamic Content

15 years 8 months ago

Download cs.haifa.ac.il

Parallel corpora are indispensable resources for a variety of multilingual natural language processing tasks. This paper presents a technique for fully automatic construction of c...

Yulia Tsvetkov, Shuly Wintner

claim paper

Read More »

151

click to vote

LREC
2010

213views Education» more LREC 2010»

Active Learning and Crowd-Sourcing for Machine Translation

15 years 8 months ago

Download www.cs.cmu.edu

In recent years, corpus based approaches to machine translation have become predominant, with Statistical Machine Translation (SMT) being the most actively progressing area. Succe...

Vamshi Ambati, Stephan Vogel, Jaime G. Carbonell

claim paper

Read More »

148

click to vote

ACL
2007

148views Computational Linguistics» more ACL 2007»

Assisting Translators in Indirect Lexical Transfer

15 years 8 months ago

Download corpus.leeds.ac.uk

We present the design and evaluation of a translator’s amenuensis that uses comparable corpora to propose and rank nonliteral solutions to the translation of expressions from th...

Bogdan Babych, Anthony Hartley, Serge Sharoff, Olg...

claim paper

Read More »

156

click to vote

AMTA
1998
Springer

103views Information Technology» more AMTA 1998»

Parallel Strands: A Preliminary Investigation into Mining the Web for Bilingual Text

15 years 11 months ago

Download www.lib.umd.edu

Abstract. Parallel corpora are a valuable resource for machine translation, but at present their availability and utility is limited by genreand domain-speci city, licensing restri...

Philip Resnik

claim paper

Read More »

185

click to vote

KES
2005
Springer

139views Information Technology» more KES 2005»

Learning Method for Automatic Acquisition of Translation Knowledge

16 years 4 days ago

Download sig.media.eng.hokudai.ac.jp

This paper presents a new learning method for automatic acquisition of translation knowledge from parallel corpora. We apply this learning method to automatic extraction of bilingu...

Hiroshi Echizen-ya, Kenji Araki, Yoshio Momouchi

claim paper

Read More »

205

click to vote

TSD
2007
Springer

111views Signal Processing» more TSD 2007»

Using Query-Relevant Documents Pairs for Cross-Lingual Information Retrieval

16 years 23 days ago

Download users.dsic.upv.es

The world wide web is a natural setting for cross-lingual information retrieval. The European Union is a typical example of a multilingual scenario, where multiple users have to de...

David Pinto, Alfons Juan, Paolo Rosso

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers