Sciweavers

AI
2004
Springer

Term-Based Clustering and Summarization of Web Page Collections

14 years 4 months ago
Term-Based Clustering and Summarization of Web Page Collections
Effectively summarizing Web page collections becomes more and more critical as the amount of information continues to grow on the World Wide Web. A concise and meaningful summary of a Web page collection, which is generated automatically, can help Web users understand the essential topics and main contents covered in the collection quickly without spending much browsing time. However, automatically generating coherent summaries as good as human-authored summaries is a challenging task since Web page collections often contain diverse topics and contents. This research aims towards clustering of Web page collections using automatically extracted topical terms, and automatic summarization of the resulting clusters. We experiment with word- and term-based representations of Web documents and demonstrate that term-based clustering significantly outperforms word-based clustering with much lower dimensionality. The summaries of computed clusters are informative and meaningful, which indicat...
Yongzheng Zhang, A. Nur Zincir-Heywood, Evangelos
Added 30 Jun 2010
Updated 30 Jun 2010
Type Conference
Year 2004
Where AI
Authors Yongzheng Zhang, A. Nur Zincir-Heywood, Evangelos E. Milios
Comments (0)