Search Sciweavers | Sciweavers

41 search results - page 1 / 9

» Large Scale Parallel Document Mining for Machine Translation

164

click to vote

COLING
2010

108views Computational Linguistics» more COLING 2010»

Large Scale Parallel Document Mining for Machine Translation

15 years 1 months ago

Download static.googleusercontent.com

A distributed system is described that reliably mines parallel text from large corpora. The approach can be regarded as cross-language near-duplicate detection, enabled by an init...

Jakob Uszkoreit, Jay Ponte, Ashok C. Popat, Moshe ...

claim paper

Read More »

183

click to vote

ACL
2008

160views Computational Linguistics» more ACL 2008»

Mining Parenthetical Translations from the Web by Word Alignment

15 years 8 months ago

Download www.aclweb.org

Documents in languages such as Chinese, Japanese and Korean sometimes annotate terms with their translations in English inside a pair of parentheses. We present a method to extrac...

Dekang Lin, Shaojun Zhao, Benjamin Van Durme, Mari...

claim paper

Read More »

170

click to vote

ACL
2011

190views Computational Linguistics» more ACL 2011»

A Large Scale Distributed Syntactic, Semantic and Lexical Language Model for Machine Translation

14 years 10 months ago

Download www.cs.wright.edu

This paper presents an attempt at building a large scale distributed composite language model that simultaneously accounts for local word lexical information, mid-range sentence s...

Ming Tan, Wenli Zhou, Lei Zheng, Shaojun Wang

claim paper

Read More »

157

click to vote

LREC
2008

109views Education» more LREC 2008»

Creating Sentence-Aligned Parallel Text Corpora from a Large Archive of Potential Parallel Text using BITS and Champollion

15 years 8 months ago

Download www.lrec-conf.org

Parallel text is one of the most valuable resources for development of statistical machine translation systems and other NLP applications. The Linguistic Data Consortium (LDC) has...

Kazuaki Maeda, Xiaoyi Ma, Stephanie Strassel

claim paper

Read More »

156

click to vote

AMTA
1998
Springer

103views Information Technology» more AMTA 1998»

Parallel Strands: A Preliminary Investigation into Mining the Web for Bilingual Text

15 years 11 months ago

Download www.lib.umd.edu

Abstract. Parallel corpora are a valuable resource for machine translation, but at present their availability and utility is limited by genreand domain-speci city, licensing restri...

Philip Resnik

claim paper

Read More »

« Prev « First page 1 / 9 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers