Sciweavers

267 search results - page 13 / 54
» Statistical Machine Translation of Texts with Misspelled Wor...
Sort
View
LREC
2008
109views Education» more  LREC 2008»
13 years 10 months ago
Creating Sentence-Aligned Parallel Text Corpora from a Large Archive of Potential Parallel Text using BITS and Champollion
Parallel text is one of the most valuable resources for development of statistical machine translation systems and other NLP applications. The Linguistic Data Consortium (LDC) has...
Kazuaki Maeda, Xiaoyi Ma, Stephanie Strassel
ACL
2008
13 years 10 months ago
Distributed Word Clustering for Large Scale Class-Based Language Modeling in Machine Translation
In statistical language modeling, one technique to reduce the problematic effects of data sparsity is to partition the vocabulary into equivalence classes. In this paper we invest...
Jakob Uszkoreit, Thorsten Brants
ICMLC
2010
Springer
13 years 6 months ago
An improvement of translation quality with adding key-words in parallel corpus
: In this paper, we propose a new approach to improve the translation quality by adding the Key-Words of a sentence to the parallel corpus. The main idea of the approach is to find...
Liang Tian, Fai Wong, Sam Chao
MT
2002
107views more  MT 2002»
13 years 8 months ago
Translation with Scarce Bilingual Resources
Machine translation of human languages is a field almost as old as computers themselves. Recent approaches to this challenging problem aim at learning translation knowledge automat...
Yaser Al-Onaizan, Ulrich Germann, Ulf Hermjakob, K...
NAACL
2003
13 years 10 months ago
Greedy Decoding for Statistical Machine Translation in Almost Linear Time
We present improvements to a greedy decoding algorithm for statistical machine translation that reduce its time complexity from at least cubic (  ¢¡¤£¦¥¨§ when applied na...
Ulrich Germann