Sciweavers

SIGIR
1999
ACM

Summarizing Text Documents: Sentence Selection and Evaluation Metrics

14 years 4 months ago
Summarizing Text Documents: Sentence Selection and Evaluation Metrics
Human-quality text summarization systems are di cult to design, and even more di cult to evaluate, in part because documents can di er along several dimensions, such as length, writing style and lexical usage. Nevertheless, certain cues can often help suggest the selection of sentences for inclusion in a summary. This paper presents our analysis of news-article summaries generated by sentence selection. Sentences are ranked for potential inclusion in the summary using a weighted combination of statistical and linguistic features. The statistical features were adapted from standard IR methods. The potential linguistic ones were derived from an analysis of news-wire summaries. To evaluate these features we use a normalized version of precision-recall curves, with a baseline of random sentence selection, as well as analyze the properties of such a baseline. We illustrate our discussions with empirical results showing the importance of corpus-dependent baseline summarization standards, com...
Jade Goldstein, Mark Kantrowitz, Vibhu O. Mittal,
Added 03 Aug 2010
Updated 03 Aug 2010
Type Conference
Year 1999
Where SIGIR
Authors Jade Goldstein, Mark Kantrowitz, Vibhu O. Mittal, Jaime G. Carbonell
Comments (0)