Sciweavers

48 search results - page 3 / 10
» Unsupervised Tokenization for Machine Translation
Sort
View
ICASSP
2008
IEEE
14 years 4 months ago
Language modeling for voice search: A machine translation approach
This paper presents a novel approach to language modeling for voice search based on the idea and method of statistical machine translation. We propose an n-gram based translation ...
Xiao Li, Yun-Cheng Ju, Geoffrey Zweig, Alex Acero
CICLING
2009
Springer
14 years 10 months ago
Enriching Statistical Translation Models Using a Domain-Independent Multilingual Lexical Knowledge Base
This paper presents a method for improving phrase-based Statistical Machine Translation systems by enriching the original translation model with information derived from a multilin...
Miguel García, Jesús Giménez,...
ACL
2008
13 years 11 months ago
Distributed Word Clustering for Large Scale Class-Based Language Modeling in Machine Translation
In statistical language modeling, one technique to reduce the problematic effects of data sparsity is to partition the vocabulary into equivalence classes. In this paper we invest...
Jakob Uszkoreit, Thorsten Brants
EMNLP
2007
13 years 11 months ago
Large Language Models in Machine Translation
This paper reports on the benefits of largescale statistical language modeling in machine translation. A distributed infrastructure is proposed which we use to train on up to 2 t...
Thorsten Brants, Ashok C. Popat, Peng Xu, Franz Jo...
FINTAL
2006
14 years 1 months ago
Using Alignment Templates to Infer Shallow-Transfer Machine Translation Rules
When building rule-based machine translation systems, a considerable human effort is needed to code the transfer rules that are able to translate source-language sentences into gra...
Felipe Sánchez-Martínez, Hermann Ney