Sciweavers

EMNLP
2008
14 years 29 days ago
Phrase Translation Probabilities with ITG Priors and Smoothing as Learning Objective
The conditional phrase translation probabilities constitute the principal components of phrase-based machine translation systems. These probabilities are estimated using a heurist...
Markos Mylonakis, Khalil Sima'an
EMNLP
2008
14 years 29 days ago
Bayesian Unsupervised Topic Segmentation
This paper describes a novel Bayesian approach to unsupervised topic segmentation. Unsupervised systems for this task are driven by lexical cohesion: the tendency of wellformed se...
Jacob Eisenstein, Regina Barzilay
EMNLP
2008
14 years 29 days ago
Improving Interactive Machine Translation via Mouse Actions
Although Machine Translation (MT) is a very active research field which is receiving an increasing amount of attention from the research community, the results that current MT sys...
Germán Sanchis-Trilles, Daniel Ortiz-Mart&i...
EMNLP
2008
14 years 29 days ago
Better Binarization for the CKY Parsing
We present a study on how grammar binarization empirically affects the efficiency of the CKY parsing. We argue that binarizations affect parsing efficiency primarily by affecting ...
Xinying Song, Shilin Ding, Chin-Yew Lin
EMNLP
2008
14 years 29 days ago
Learning to Predict Code-Switching Points
Predicting possible code-switching points can help develop more accurate methods for automatically processing mixed-language text, such as multilingual language models for speech ...
Thamar Solorio, Yang Liu
EMNLP
2008
14 years 29 days ago
Maximum Entropy based Rule Selection Model for Syntax-based Statistical Machine Translation
This paper proposes a novel maximum entropy based rule selection (MERS) model for syntax-based statistical machine translation (SMT). The MERS model combines local contextual info...
Qun Liu, Zhongjun He, Yang Liu, Shouxun Lin
EMNLP
2008
14 years 29 days ago
Two Languages are Better than One (for Syntactic Parsing)
We show that jointly parsing a bitext can substantially improve parse quality on both sides. In a maximum entropy bitext parsing model, we define a distribution over source trees,...
David Burkett, Dan Klein
EMNLP
2008
14 years 29 days ago
Refining Generative Language Models using Discriminative Learning
We propose a new approach to language modeling which utilizes discriminative learning methods. Our approach is an iterative one: starting with an initial language model, in each i...
Ben Sandbank
EMNLP
2008
14 years 29 days ago
Cheap and Fast - But is it Good? Evaluating Non-Expert Annotations for Natural Language Tasks
Human linguistic annotation is crucial for many natural language processing tasks but can be expensive and time-consuming. We explore the use of Amazon's Mechanical Turk syst...
Rion Snow, Brendan O'Connor, Daniel Jurafsky, Andr...
EMNLP
2008
14 years 29 days ago
Multimodal Subjectivity Analysis of Multiparty Conversation
We investigate the combination of several sources of information for the purpose of subjectivity recognition and polarity classification in meetings. We focus on features from two...
Stephan Raaijmakers, Khiet P. Truong, Theresa Wils...