Sciweavers

ACL
2009
13 years 5 months ago
Phrase-Based Statistical Machine Translation as a Traveling Salesman Problem
An efficient decoding algorithm is a crucial element of any statistical machine translation system. Some researchers have noted certain similarities between SMT decoding and the f...
Mikhail Zaslavskiy, Marc Dymetman, Nicola Cancedda
ACL
2009
13 years 5 months ago
Cross Language Dependency Parsing using a Bilingual Lexicon
This paper proposes an approach to enhance dependency parsing in a language by using a translated treebank from another language. A simple statistical machine translation method, ...
Hai Zhao, Yan Song, Chunyu Kit, Guodong Zhou
ACL
2009
13 years 5 months ago
Reducing the Annotation Effort for Letter-to-Phoneme Conversion
Letter-to-phoneme (L2P) conversion is the process of producing a correct phoneme sequence for a word, given its letters. It is often desirable to reduce the quantity of training d...
Kenneth Dwyer, Grzegorz Kondrak
ACL
2009
13 years 5 months ago
Quadratic-Time Dependency Parsing for Machine Translation
Efficiency is a prime concern in syntactic MT decoding, yet significant developments in statistical parsing with respect to asymptotic efficiency haven't yet been explored in...
Michel Galley, Christopher D. Manning
ACL
2009
13 years 5 months ago
Learning to Tell Tales: A Data-driven Approach to Story Generation
Computational story telling has sparked great interest in artificial intelligence, partly because of its relevance to educational and gaming applications. Traditionally, story gen...
Neil McIntyre, Mirella Lapata
ACL
2009
13 years 5 months ago
Robust Machine Translation Evaluation with Entailment Features
Existing evaluation metrics for machine translation lack crucial robustness: their correlations with human quality judgments vary considerably across languages and genres. We beli...
Sebastian Padó, Michel Galley, Daniel Juraf...
ACL
2009
13 years 5 months ago
Setting Up User Action Probabilities in User Simulations for Dialog System Development
User simulations are shown to be useful in spoken dialog system development. Since most current user simulations deploy probability models to mimic human user behaviors, how to se...
Hua Ai, Diane J. Litman
ACL
2009
13 years 5 months ago
Knowing the Unseen: Estimating Vocabulary Size over Unseen Samples
Empirical studies on corpora involve making measurements of several quantities for the purpose of comparing corpora, creating language models or to make generalizations about spec...
Suma Bhat, Richard Sproat
ACL
2009
13 years 5 months ago
Variational Decoding for Statistical Machine Translation
Statistical models in machine translation exhibit spurious ambiguity. That is, the probability of an output string is split among many distinct derivations (e.g., trees or segment...
Zhifei Li, Jason Eisner, Sanjeev Khudanpur
ACL
2009
13 years 5 months ago
A non-contiguous Tree Sequence Alignment-based Model for Statistical Machine Translation
The tree sequence based translation model allows the violation of syntactic boundaries in a rule to capture non-syntactic phrases, where a tree sequence is a contiguous sequence o...
Jun Sun, Min Zhang, Chew Lim Tan