Sciweavers

561 search results - page 39 / 113
» Randomised Language Modelling for Statistical Machine Transl...
Sort
View
EMNLP
2009
13 years 5 months ago
Unsupervised Tokenization for Machine Translation
Training a statistical machine translation starts with tokenizing a parallel corpus. Some languages such as Chinese do not incorporate spacing in their writing system, which creat...
Tagyoung Chung, Daniel Gildea
ACL
2010
13 years 5 months ago
Unsupervised Search for the Optimal Segmentation for Statistical Machine Translation
We tackle the previously unaddressed problem of unsupervised determination of the optimal morphological segmentation for statistical machine translation (SMT) and propose a segmen...
Coskun Mermer, Ahmet Afsn Akn
NLE
2007
180views more  NLE 2007»
13 years 7 months ago
Segmentation and alignment of parallel text for statistical machine translation
We address the problem of extracting bilingual chunk pairs from parallel text to create training sets for statistical machine translation. We formulate the problem in terms of a s...
Yonggang Deng, Shankar Kumar, William Byrne
COLING
2008
13 years 9 months ago
Linguistically Annotated BTG for Statistical Machine Translation
Bracketing Transduction Grammar (BTG) is a natural choice for effective integration of desired linguistic knowledge into statistical machine translation (SMT). In this paper, we p...
Deyi Xiong, Min Zhang, AiTi Aw, Haizhou Li
IUI
2010
ACM
14 years 4 months ago
Interactive machine translation using a web-based architecture
In this paper we present a new way of translating documents by using a Web-based system. An interactive approach is proposed as an alternative to post-editing the output of a mach...
Daniel Ortiz-Martínez, Luis A. Leiva, Vicen...