Sciweavers

EMNLP
2008
14 years 1 months ago
Mention Detection Crossing the Language Barrier
While significant effort has been put into annotating linguistic resources for several languages, there are still many left that have only small amounts of such resources. This p...
Imed Zitouni, Radu Florian
EMNLP
2008
14 years 1 months ago
Incorporating Temporal and Semantic Information with Eye Gaze for Automatic Word Acquisition in Multimodal Conversational System
One major bottleneck in conversational systems is their incapability in interpreting unexpected user language inputs such as out-ofvocabulary words. To overcome this problem, conv...
Shaolin Qu, Joyce Yue Chai
EMNLP
2008
14 years 1 months ago
HotSpots: Visualizing Edits to a Text
Compared to the telephone, email based customer care is increasingly becoming the preferred channel of communication for corporations and customers. Most email-based customer care...
Srinivas Bangalore, David Smith
EMNLP
2008
14 years 2 months ago
A Dependency-based Word Subsequence Kernel
This paper introduces a new kernel which computes similarity between two natural language sentences as the number of paths shared by their dependency trees. The paper gives a very...
Rohit J. Kate
EMNLP
2008
14 years 2 months ago
N-gram Weighting: Reducing Training Data Mismatch in Cross-Domain Language Model Estimation
In domains with insufficient matched training data, language models are often constructed by interpolating component models trained from partially matched corpora. Since the ngram...
Bo-June Paul Hsu, James R. Glass
EMNLP
2008
14 years 2 months ago
Triplet Lexicon Models for Statistical Machine Translation
This paper describes a lexical trigger model for statistical machine translation. We present various methods using triplets incorporating long-distance dependencies that can go be...
Sasa Hasan, Juri Ganitkevitch, Hermann Ney, Jes&ua...
EMNLP
2008
14 years 2 months ago
Lattice-based Minimum Error Rate Training for Statistical Machine Translation
Minimum Error Rate Training (MERT) is an effective means to estimate the feature function weights of a linear model such that an automated evaluation criterion for measuring syste...
Wolfgang Macherey, Franz Josef Och, Ignacio Thayer...
EMNLP
2008
14 years 2 months ago
Scalable Language Processing Algorithms for the Masses: A Case Study in Computing Word Co-occurrence Matrices with MapReduce
This paper explores the challenge of scaling up language processing algorithms to increasingly large datasets. While cluster computing has been available in commercial environment...
Jimmy J. Lin
EMNLP
2008
14 years 2 months ago
Learning with Probabilistic Features for Improved Pipeline Models
We present a novel learning framework for pipeline models aimed at improving the communication between consecutive stages in a pipeline. Our method exploits the confidence scores ...
Razvan C. Bunescu