
14 years 1 months ago
A Web-Trained Extraction Summarization System
A serious bottleneck in the development of trainable text summarization systems is the shortage of training data. Constructing such data is a very tedious task, especially because...
Liang Zhou, Eduard H. Hovy
14 years 1 months ago
Identifying Opinionated Sentences
Theresa Wilson, David R. Pierce, Janyce Wiebe
14 years 1 months ago
Language choice models for microplanning and readability
This paper describes the construction of language choice models for the microplanning of discourse relations in a Natural Language Generation system that attempts to generate appr...
Sandra Williams
14 years 1 months ago
Monolingual and Bilingual Concept Visualization from Corpora
e by placing terms in an abstract ‘information space’ based on their occurrences in text corpora, and then allowing a user to visualize local regions of this information space....
Dominic Widdows, Scott Cederberg
14 years 1 months ago
Unsupervised methods for developing taxonomies by combining syntactic and statistical information
This paper describes an unsupervised algorithm for placing unknown words into a taxonomy and evaluates its accuracy on a large and varied sample of words. The algorithm works by ï...
Dominic Widdows
14 years 1 months ago
Speechalator: Two-Way Speech-to-Speech Translation in Your Hand
This demonstration involves two-way automatic speechto-speech translation on a consumer off-the-shelf PDA. This work was done as part of the DARPA-funded Babylon project, investig...
Alex Waibel, Ahmed Badran, Alan W. Black, Robert E...
14 years 1 months ago
Toward a Task-based Gold Standard for Evaluation of NP Chunks and Technical Terms
We propose a gold standard for evaluating two types of information extraction output -- noun phrase (NP) chunks (Abney 1991; Ramshaw and Marcus 1995) and technical terms (Justeson...
Nina Wacholder, Peng Song
14 years 1 months ago
Evaluating the Evaluation: A Case Study Using the TREC 2002 Question Answering Track
Evaluating competing technologies on a common problem set is a powerful way to improve the state of the art and hasten technology transfer. Yet poorly designed evaluations can was...
Ellen M. Voorhees
14 years 1 months ago
Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network
We present a new part-of-speech tagger that demonstrates the following ideas: (i) explicit use of both preceding and following tag contexts via a dependency network representation...
Kristina Toutanova, Dan Klein, Christopher D. Mann...