Optimizing index for taxonomy keyword search

13 years 9 months ago

Download www.cs.uiuc.edu

Query substitution is an important problem in information retrieval. Much work focuses on how to ﬁnd substitutes for any given query. In this paper, we study how to eﬃciently process a keyword query whose substitutes are deﬁned by a given taxonomy. This problem is challenging because each term in a query can have a large number of substitutes, and the original query can be rewritten into any of their combinations. We propose to build an additional index (besides inverted index) to eﬃciently process queries. For a query workload, we formulate an optimization problem which chooses the additional index structure, aiming at minimizing the query evaluation cost, under given index space constraints. We show the NP-hardness of the problem, and propose a pseudo-polynomial time algorithm using dynamic programming, as well as an 1 4 (1−1/e)-approximation algorithm to solve the problem. Experimental results show that, with only 10% additional index space, our approach can greatly reduc...

Bolin Ding, Haixun Wang, Ruoming Jin, Jiawei Han,

Real-time Traffic

Approximation Algorithm | Army Research Laboratory | Database | Polynomial Time Algorithm | SIGMOD 2012 |

claim paper

» Keyword Proximity Search on XML Graphs

» Searching large indexes on tiny devices optimizing binary search with character pinning

» Topk Exploration of Query Candidates for Efficient Keyword Search on GraphShaped RDF Data

» A Taxonomy of JavaScript Redirection Spam

» Automatically learning document taxonomies for hierarchical classification

» Using taxonomies for contentbased routing with ants

» Multidimensional keywordbased image annotation and search

» On the Feasibility of PeertoPeer Web Indexing and Search

Post Info
More Details (n/a)

Added	27 Sep 2012
Updated	27 Sep 2012
Type	Journal
Year	2012
Where	SIGMOD
Authors	Bolin Ding, Haixun Wang, Ruoming Jin, Jiawei Han, Zhongyuan Wang

Comments (0)

Sciweavers

Optimizing index for taxonomy keyword search

Approximation Algorithm | Army Research Laboratory | Database | Polynomial Time Algorithm | SIGMOD 2012 |

Explore & Download

Productivity Tools

Sciweavers