Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

194

NAACL
2003

113views Computational Linguistics» more NAACL 2003»

A Web-Trained Extraction Summarization System

15 years 8 months ago

A Web-Trained Extraction Summarization System

Download acl.ldc.upenn.edu

A serious bottleneck in the development of trainable text summarization systems is the shortage of training data. Constructing such data is a very tedious task, especially because there are in general many different correct ways to summarize a text. Fortunately we can utilize the Internet as a source of suitable training data. In this paper, we present a summarization system that uses the web as the source of training data. The procedure involves structuring the articles downloaded from various websites, building adequate corpora of (summary, text) and (extract, text) pairs, training on positive and negative data, and automatically learning to perform the task of extraction-based summarization at a level comparable to the best DUC systems.

Liang Zhou, Eduard H. Hovy

Real-time Traffic

NAACL 2003 | NAACL 2007 | Suitable Training Data | Trainable Text Summarization | Training Data |

claim paper

Related Content

» Quantifying the Limits and Success of Extractive Summarization Systems Across Domains

» Generic summarization and keyphrase extraction using mutual reinforcement principle and se...

» RhetoricalState Hidden Markov Models for extractive speech summarization

» Extractive spoken document summarization for information retrieval

» Resources for Evaluation of Summarization Techniques

» EUSUM extracting easytounderstand english summaries for nonnative readers

» Towards Automated Related Work Summarization

» A Hybrid Hierarchical Model for MultiDocument Summarization

» Reranking Summaries Based on CrossDocument Information Extraction

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2003
Where	NAACL
Authors	Liang Zhou, Eduard H. Hovy

Comments (0)