We present a method for disambiguating syntactic subjects from syntactic objects (a frequent ambiguity) in German sentences taken from an English-German bitext. We exploit the fac...
Florian Schwarck, Alexander Fraser, Hinrich Sch&uu...
Extraction of entities from ad creatives is an important problem that can benefit many computational advertising tasks. Supervised and semi-supervised solutions rely on labeled da...
We consider the problem of predicting a movie's opening weekend revenue. Previous work on this problem has used metadata about a movie--e.g., its genre, MPAA rating, and cast...
Mahesh Joshi, Dipanjan Das, Kevin Gimpel, Noah A. ...
Automatically finding email messages that contain requests for action can provide valuable assistance to users who otherwise struggle to give appropriate attention to the actionab...
We show how features can easily be added to standard generative models for unsupervised learning, without requiring complex new training methods. In particular, each component mul...
Taylor Berg-Kirkpatrick, Alexandre Bouchard-C&ocir...
We present Quantized Contour Modeling (QCM), a Bayesian approach to the classification of acoustic contours. We evaluate the performance of this technique in the classification of...
We describe a Bayesian inference algorithm that can be used to train any cascade of weighted finite-state transducers on end-toend data. We also investigate the problem of automat...
David Chiang, Jonathan Graehl, Kevin Knight, Adam ...
We describe a utility evaluation to determine whether cross-document information extraction (IE) techniques measurably improve user performance in news summary writing. Two groups...
Heng Ji, Zheng Chen, Jonathan Feldman, Antonio Gon...
In this work, we try a hybrid methodology for language modeling where both morphological decomposition and factored language modeling (FLM) are exploited to deal with the complex ...
This paper analyzes the topic identification stage of single-document automatic text summarization across four different domains, consisting of newswire, literary, scientific and ...
Hakan Ceylan, Rada Mihalcea, Umut O'zertem, Elena ...