Summarization- and learning-based approaches to information distillation

14 years 21 days ago

Download www.icsi.berkeley.edu

Information distillation is the task that aims to extract relevant passages of text from massive volumes of textual and audio sources, given a query. In this paper, we investigate two perspectives that use shallow language processing for answering open-ended distillation queries, such as “List me facts about [event]”. The rst approach is a summarization-based approach that uses the unsupervised maximum marginal relevance (MMR) technique to successfully capture relevant but not redundant information. The second approach is based on supervised classi cation and trains support vector machines (SVMs) to discriminate relevant snippets from irrelevant snippets using a variety of features. Furthermore, we investigate the merit of using the ROUGE metric for its ability to evaluate redundancy alongside the conventionally used F-measure for evaluating distillation systems. Our experimental results with textual data indicate that SVM and MMR perform similarly in terms of ROUGE-2 scores while...

Boriska Toth, Dilek Hakkani-Tür, Sibel Yaman

Real-time Traffic

Distillation | ICASSP 2010 | Information Distillation | Maximum Marginal Relevance | Signal Processing |

claim paper

Post Info
More Details (n/a)

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	ICASSP
Authors	Boriska Toth, Dilek Hakkani-Tür, Sibel Yaman

Comments (0)

Sciweavers

Summarization- and learning-based approaches to information distillation

Distillation | ICASSP 2010 | Information Distillation | Maximum Marginal Relevance | Signal Processing |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers