Search Sciweavers | Sciweavers

51 search results - page 8 / 11

» Automatic Filtering of Bilingual Corpora for Statistical Mac...

click to vote

LREC
2008

114views Education» more LREC 2008»

Improving Statistical Machine Translation Efficiency by Triangulation

13 years 9 months ago

Download www.lrec-conf.org

In current phrase-based Statistical Machine Translation systems, more training data is generally better than less. However, a larger data set eventually introduces a larger model ...

Yu Chen, Andreas Eisele, Martin Kay

claim paper

Read More »

click to vote

ACL
2010

178views Computational Linguistics» more ACL 2010»

Pseudo-Word for Phrase-Based Machine Translation

13 years 5 months ago

Download www.aclweb.org

The pipeline of most Phrase-Based Statistical Machine Translation (PB-SMT) systems starts from automatically word aligned parallel corpus. But word appears to be too fine-grained ...

Xiangyu Duan, Min Zhang, Haizhou Li

claim paper

Read More »

click to vote

ACL
2008

168views Computational Linguistics» more ACL 2008»

Distributed Word Clustering for Large Scale Class-Based Language Modeling in Machine Translation

13 years 9 months ago

Download www.aclweb.org

In statistical language modeling, one technique to reduce the problematic effects of data sparsity is to partition the vocabulary into equivalence classes. In this paper we invest...

Jakob Uszkoreit, Thorsten Brants

claim paper

Read More »

click to vote

COLING
2010

137views Computational Linguistics» more COLING 2010»

An Empirical Study on Web Mining of Parallel Data

13 years 2 months ago

Download www.aclweb.org

This paper1 presents an empirical approach to mining parallel corpora. Conventional approaches use a readily available collection of comparable, nonparallel corpora to extract par...

Gum-Won Hong, Chi-Ho Li, Ming Zhou, Hae-Chang Rim

claim paper

Read More »

click to vote

ECIR
2010
Springer

151views Information Technology» more ECIR 2010»

Estimating Translation Probabilities from the Web for Structured Queries on CLIR

13 years 9 months ago

Download www.elhuyar.org

We present two methods for estimating replacement probabilities without using parallel corpora. The first method proposed exploits the possible translation probabilities latent in ...

Xabier Saralegi, Maddalen Lopez de Lacalle

claim paper

Read More »

« Prev « First page 8 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers