Learning Summary Content Units with Topic Modeling

15 years 1 months ago

Download www.aclweb.org

In the field of multi-document summarization, the Pyramid method has become an important approach for evaluating machine-generated summaries. The method is based on the manual annotation of text spans with the same meaning in a set of human model summaries. In this paper, we present an unsupervised, probabilistic topic modeling approach for automatically identifying such semantically similar text spans. Our approach reveals some of the structure of model summaries and identifies topics that are good approximations of the Summary Content Units (SCU) used in the Pyramid method. Our results show that the topic model identifies topic-sentence associations that correspond to the contributors of SCUs, suggesting that the topic modeling approach can generate a viable set of candidate SCUs for facilitating the creation of Pyramids.

Leonhard Hennig, Ernesto William De Luca, Sahin Al

Real-time Traffic

COLING 2010 | Computational Linguistics | Model Identifies Topic-sentence | Pyramid Method | Summaries |

claim paper

» Topic and keyword reranking for LDAbased topic modeling

» QueryFocused Summaries or QueryBiased Summaries

» Hierarchical Orderings of Textual Units

» Learning From Collective Human Behavior to Introduce Diversity in Lexical Choice

» Evaluation of a Sentence Ranker for Text Summarization Based on Rogets Thesaurus

» AdaSum an adaptive model for summarization

» Topic Cube Topic Modeling for OLAP on Multidimensional Text Databases

» A TwoDimensional TopicAspect Model for Discovering MultiFaceted Topics

Post Info
More Details (n/a)

Added	13 May 2011
Updated	13 May 2011
Type	Journal
Year	2010
Where	COLING
Authors	Leonhard Hennig, Ernesto William De Luca, Sahin Albayrak

Comments (0)

Sciweavers

Learning Summary Content Units with Topic Modeling

COLING 2010 | Computational Linguistics | Model Identifies Topic-sentence | Pyramid Method | Summaries |

Explore & Download

Productivity Tools

Sciweavers