Sciweavers

136 search results - page 3 / 28
» Automatic Alignment in Parallel Corpora
Sort
View
LREC
2008
109views Education» more  LREC 2008»
13 years 9 months ago
Creating Sentence-Aligned Parallel Text Corpora from a Large Archive of Potential Parallel Text using BITS and Champollion
Parallel text is one of the most valuable resources for development of statistical machine translation systems and other NLP applications. The Linguistic Data Consortium (LDC) has...
Kazuaki Maeda, Xiaoyi Ma, Stephanie Strassel
IAJIT
2011
13 years 2 months ago
Improving the accuracy of English-Arabic statistical sentence alignment
: Multilingual natural language processing systems are increasingly relying on parallel corpus to ameliorate their output. Parallel corpora constitute the basic block for training ...
Mohammad Salameh, Rached Zantout, Nashat Mansour
NAACL
2004
13 years 9 months ago
What's in a translation rule?
We propose a theory that gives formal semantics to word-level alignments defined over parallel corpora. We use our theory to introduce a linear algorithm that can be used to deriv...
Michel Galley, Mark Hopkins, Kevin Knight, Daniel ...
ACTAC
2008
145views more  ACTAC 2008»
13 years 7 months ago
Sentence Alignment of Hungarian-English Parallel Corpora Using a Hybrid Algorithm
We present an ecient hybrid method for aligning sentences with their translations in a parallel bilingual corpus. The new algorithm is composed of a length-based and anchor matchi...
Krisztina Tóth, Richárd Farkas, Andr...
ECIR
2006
Springer
13 years 9 months ago
Automatic Acquisition of Chinese-English Parallel Corpus from the Web
Parallel corpora are a valuable resource for tasks such as cross-language information retrieval and data-driven natural language processing systems. Previously only small scale cor...
Ying Zhang, Ke Wu, Jianfeng Gao, Phil Vines