Sciweavers

EMNLP
2010
13 years 10 months ago
A Multi-Pass Sieve for Coreference Resolution
Most coreference resolution models determine if two mentions are coreferent using a single function over a set of constraints or features. This approach can lead to incorrect deci...
Karthik Raghunathan, Heeyoung Lee, Sudarshan Ranga...
EMNLP
2010
13 years 10 months ago
Incorporating Content Structure into Text Analysis Applications
In this paper, we investigate how modeling content structure can benefit text analysis applications such as extractive summarization and sentiment analysis. This follows the lingu...
Christina Sauper, Aria Haghighi, Regina Barzilay
EMNLP
2010
13 years 10 months ago
Cross Language Text Classification by Model Translation and Semi-Supervised Learning
In this paper, we introduce a method that automatically builds text classifiers in a new language by training on already labeled data in another language. Our method transfers the...
Lei Shi, Rada Mihalcea, Mingjun Tian
EMNLP
2010
13 years 10 months ago
Minimum Error Rate Training by Sampling the Translation Lattice
Minimum Error Rate Training is the algorithm for log-linear model parameter training most used in state-of-the-art Statistical Machine Translation systems. In its original formula...
Samidh Chatterjee, Nicola Cancedda
EMNLP
2010
13 years 10 months ago
Hierarchical Phrase-Based Translation Grammars Extracted from Alignment Posterior Probabilities
We report on investigations into hierarchical phrase-based translation grammars based on rules extracted from posterior distributions over alignments of the parallel text. Rather ...
Adrià de Gispert, Juan Pino, William J. Byr...
EMNLP
2010
13 years 10 months ago
The Necessity of Combining Adaptation Methods
Problems stemming from domain adaptation continue to plague the statistical natural language processing community. There has been continuing work trying to find general purpose al...
Ming-Wei Chang, Michael Connor, Dan Roth
EMNLP
2010
13 years 10 months ago
A Hybrid Morpheme-Word Representation for Machine Translation of Morphologically Rich Languages
We propose a language-independent approach for improving statistical machine translation for morphologically rich languages using a hybrid morpheme-word representation where the b...
Minh-Thang Luong, Preslav Nakov, Min-Yen Kan
EMNLP
2010
13 years 10 months ago
Holistic Sentiment Analysis Across Languages: Multilingual Supervised Latent Dirichlet Allocation
In this paper, we develop multilingual supervised latent Dirichlet allocation (MLSLDA), a probabilistic generative model that allows insights gleaned from one language's data...
Jordan L. Boyd-Graber, Philip Resnik
EMNLP
2010
13 years 10 months ago
Summarizing Contrastive Viewpoints in Opinionated Text
This paper presents a two-stage approach to summarizing multiple contrastive viewpoints in opinionated text. In the first stage, we use an unsupervised probabilistic approach to m...
Michael Paul, ChengXiang Zhai, Roxana Girju