Sciweavers

281 search results - page 34 / 57
» Introducing the Enron Corpus
Sort
View
ACL
2009
13 years 5 months ago
Active Learning for Multilingual Statistical Machine Translation
Statistical machine translation (SMT) models require bilingual corpora for training, and these corpora are often multilingual with parallel text in multiple languages simultaneous...
Gholamreza Haffari, Anoop Sarkar
ACL
2009
13 years 5 months ago
Confidence Measure for Word Alignment
In this paper we present a confidence measure for word alignment based on the posterior probability of alignment links. We introduce sentence alignment confidence measure and alig...
Fei Huang
ACL
2009
13 years 5 months ago
Hybrid Approach to User Intention Modeling for Dialog Simulation
This paper proposes a novel user intention simulation method which is a data-driven approach but able to integrate diverse user discourse knowledge together to simulate various ty...
Sangkeun Jung, Cheongjae Lee, Kyungduk Kim, Gary G...
EMNLP
2009
13 years 5 months ago
Multi-Class Confidence Weighted Algorithms
The recently introduced online confidence-weighted (CW) learning algorithm for binary classification performs well on many binary NLP tasks. However, for multi-class problems CW l...
Koby Crammer, Mark Dredze, Alex Kulesza
ICWSM
2009
13 years 5 months ago
Regression-Based Summarization of Email Conversations
In this paper we present a regression-based machine learning approach to email thread summarization. The regression model is able to take advantage of multiple gold-standard annot...
Jan Ulrich, Giuseppe Carenini, Gabriel Murray, Ray...