Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization

15 years 7 months ago

Download users.cis.fiu.edu

Multi-document summarization aims to create a compressed summary while retaining the main characteristics of the original set of documents. Many approaches use statistics and machine learning techniques to extract sentences from documents. In this paper, we propose a new multi-document summarization framework based on sentence-level semantic analysis and symmetric non-negative matrix factorization. We first calculate sentence-sentence similarities using semantic analysis and construct the similarity matrix. Then symmetric matrix factorization, which has been shown to be equivalent to normalized spectral clustering, is used to group sentences into clusters. Finally, the most informative sentences are selected from each group to form the summary. Experimental results on DUC2005 and DUC2006 data sets demonstrate the improvement of our proposed framework over the implemented existing summarization systems. A further study on the factors that benefit the high performance is also conducted....

Dingding Wang, Tao Li, Shenghuo Zhu, Chris H. Q. D

Real-time Traffic

Information Technology | Matrix Factorization | Multi-document Summarization | Semantic Analysis | SIGIR 2008 |

claim paper

Post Info
More Details (n/a)

Added	15 Dec 2010
Updated	15 Dec 2010
Type	Journal
Year	2008
Where	SIGIR
Authors	Dingding Wang, Tao Li, Shenghuo Zhu, Chris H. Q. Ding

Comments (0)

Sciweavers

Multi-document summarization via sentence-level semantic analysis and symmetric matrix factorization

Information Technology | Matrix Factorization | Multi-document Summarization | Semantic Analysis | SIGIR 2008 |

Explore & Download

Productivity Tools

Sciweavers