Sciweavers

76 search results - page 5 / 16
» Statistical Machine Translation with Word- and Sentence-Alig...
Sort
View
ACL
2008
13 years 10 months ago
Distributed Word Clustering for Large Scale Class-Based Language Modeling in Machine Translation
In statistical language modeling, one technique to reduce the problematic effects of data sparsity is to partition the vocabulary into equivalence classes. In this paper we invest...
Jakob Uszkoreit, Thorsten Brants
ICCPOL
2009
Springer
14 years 1 months ago
Constructing Parallel Corpus from Movie Subtitles
Abstract. This paper describes a methodology for constructing aligned German-Chinese corpora from movie subtitles. The corpora will be used to train a special machine translation s...
Han Xiao, Xiaojie Wang
COLING
1992
13 years 9 months ago
Learning Translation Templates From Bilingual Text
This paper proposes a two-phase example-based machine translation methodology which develops translation templates from examples and then translates using template matching. This ...
Hiroyuki Kaji, Yuuko Kida, Yasutsugu Morimoto
EMNLP
2007
13 years 10 months ago
Improving Word Alignment with Bridge Languages
We describe an approach to improve Statistical Machine Translation (SMT) performance using multi-lingual, parallel, sentence-aligned corpora in several bridge languages. Our appro...
Shankar Kumar, Franz Josef Och, Wolfgang Macherey
ACL
2003
13 years 10 months ago
A Probability Model to Improve Word Alignment
Word alignment plays a crucial role in statistical machine translation. Word-aligned corpora have been found to be an excellent source of translation-related knowledge. We present...
Colin Cherry, Dekang Lin