Topical query decomposition

16 years 7 months ago

Download www.chato.cl

We introduce the problem of query decomposition, where we are given a query and a document retrieval system, and we want to produce a small set of queries whose union of resulting documents corresponds approximately to that of the original query. Ideally, these queries should represent coherent, conceptually well-separated topics. We provide an abstract formulation of the query decomposition problem, and we tackle it from two different perspectives. We first show how the problem can be instantiated as a specific variant of a set cover problem, for which we provide an efficient greedy algorithm. Next, we show how the same problem can be seen as a constrained clustering problem, with a very particular kind of constraint, i.e., clustering with predefined clusters. We develop a two-phase algorithm based on hierarchical agglomerative clustering followed by dynamic programming. Our experiments, conducted on a set of actual queries in a Web scale search engine, confirm the effectiveness of t...

Francesco Bonchi, Carlos Castillo, Debora Donato,

Real-time Traffic

Constrained Clustering Problem | Data Mining | KDD 2008 | Query Decomposition Problem | Set Cover Problem |

claim paper

» Examining topic shifts in contentoriented XML retrieval

» Sentence Retrieval with LSI and Topic Identification

» From Whence Does Your Authority Come Utilizing Community Relevance in Ranking

» Extracting Topics and Innovators Using Topic Diffusion Process in Weblogs

» Spotting Topics with the Singular Value Decomposition

» Automatic Keyphrase Extraction via Topic Decomposition

» Decomposition discovery and detection of visual categories using topic models

» Probabilistic topic decomposition of an eighteenthcentury American newspaper

Post Info
More Details (n/a)

Added	30 Nov 2009
Updated	30 Nov 2009
Type	Conference
Year	2008
Where	KDD
Authors	Francesco Bonchi, Carlos Castillo, Debora Donato, Aristides Gionis

Comments (0)

Sciweavers

Topical query decomposition

Constrained Clustering Problem | Data Mining | KDD 2008 | Query Decomposition Problem | Set Cover Problem |

Explore & Download

Productivity Tools

Sciweavers