Sciweavers

ACL
2006

Topic-Focused Multi-Document Summarization Using an Approximate Oracle Score

14 years 1 months ago
Topic-Focused Multi-Document Summarization Using an Approximate Oracle Score
We consider the problem of producing a multi-document summary given a collection of documents. Since most successful methods of multi-document summarization are still largely extractive, in this paper, we explore just how well an extractive method can perform. We introduce an "oracle" score, based on the probability distribution of unigrams in human summaries. We then demonstrate that with the oracle score, we can generate extracts which score, on average, better than the human summaries, when evaluated with ROUGE. In addition, we introduce an approximation to the oracle score which produces a system with the best known performance for the 2005 Document Understanding Conference (DUC) evaluation.
John M. Conroy, Judith D. Schlesinger, Dianne P. O
Added 30 Oct 2010
Updated 30 Oct 2010
Type Conference
Year 2006
Where ACL
Authors John M. Conroy, Judith D. Schlesinger, Dianne P. O'Leary
Comments (0)