Full-Subtopic Retrieval with Keyphrase-Based Search Results Clustering

16 years 2 months ago

Download search.fub.it

We consider the problem of retrieving multiple documents relevant to the single subtopics of a given web query, termed “full-subtopic retrieval”. To solve this problem we present a novel search results clustering algorithm that generates clusters labeled by keyphrases. The keyphrases are extracted from the generalized sufﬁx tree built from the search results and merged through an improved hierarchical agglomerative clustering procedure. We also introduce a novel measure for evaluating full-subtopic retrieval performance, namely “Subtopic Search Length under k document sufﬁciency”. Using a test collection speciﬁcally designed for evaluating subtopic retrieval, we found that our algorithm outperformed both other existing search results clustering algorithms and also a search results re-ranking method that emphasized diversity of results (at least for k>1; i.e., when we are interested in retrieving more than one relevant document per subtopic). Our approach has been impl...

Andrea Bernardini, Claudio Carpineto, Massimiliano

Real-time Traffic

Full-subtopic Retrieval | Hierarchical Agglomerative Clustering | Internet Technology | Subtopic Search Length | WEBI 2009 |

claim paper

» New Research Directions in Search Results Clustering

» Optimal meta search results clustering

» IGroup a web image search engine with semantic clustering of search results

» Largescale analysis of individual and task differences in search result page examination s...

» A language for manipulating clustered web documents results

» Semantic Hierarchical Online Clustering of Web Search Results

» Web Search Results Clustering in Polish Experimental Evaluation of Carrot

» Learning to cluster web search results

Post Info
More Details (n/a)

Added	25 May 2010
Updated	25 May 2010
Type	Conference
Year	2009
Where	WEBI
Authors	Andrea Bernardini, Claudio Carpineto, Massimiliano D'Amico

Comments (0)

Sciweavers

Full-Subtopic Retrieval with Keyphrase-Based Search Results Clustering

Full-subtopic Retrieval | Hierarchical Agglomerative Clustering | Internet Technology | Subtopic Search Length | WEBI 2009 |

Explore & Download

Productivity Tools

Sciweavers