parallel text | Sciweavers

160

COLING
2010

108views Computational Linguistics» more COLING 2010»

Large Scale Parallel Document Mining for Machine Translation

15 years 1 months ago

A distributed system is described that reliably mines parallel text from large corpora. The approach can be regarded as cross-language near-duplicate detection, enabled by an init...

Jakob Uszkoreit, Jay Ponte, Ashok C. Popat, Moshe ...

claim paper

Read More »

182

click to vote

ACL
2009

160views Computational Linguistics» more ACL 2009»

Active Learning for Multilingual Statistical Machine Translation

15 years 4 months ago

Download aclweb.org

Statistical machine translation (SMT) models require bilingual corpora for training, and these corpora are often multilingual with parallel text in multiple languages simultaneous...

Gholamreza Haffari, Anoop Sarkar

claim paper

Read More »

210

click to vote

NLE
2007

180views more NLE 2007»

Segmentation and alignment of parallel text for statistical machine translation

15 years 6 months ago

Download mi.eng.cam.ac.uk

We address the problem of extracting bilingual chunk pairs from parallel text to create training sets for statistical machine translation. We formulate the problem in terms of a s...

Yonggang Deng, Shankar Kumar, William Byrne

claim paper

Read More »

157

click to vote

LREC
2008

109views Education» more LREC 2008»

Creating Sentence-Aligned Parallel Text Corpora from a Large Archive of Potential Parallel Text using BITS and Champollion

15 years 8 months ago

Download www.lrec-conf.org

Parallel text is one of the most valuable resources for development of statistical machine translation systems and other NLP applications. The Linguistic Data Consortium (LDC) has...

Kazuaki Maeda, Xiaoyi Ma, Stephanie Strassel

claim paper

Read More »

169

click to vote

LREC
2010

164views Education» more LREC 2010»

Enhanced Infrastructure for Creation and Collection of Translation Resources

15 years 8 months ago

Download www.lrec-conf.org

Statistical Machine Translation (MT) systems have achieved impressive results in recent years, due in large part to the increasing availability of parallel text for system trainin...

Zhiyi Song, Stephanie Strassel, Gary Krug, Kazuaki...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers