ResIn: a combination of results caching and index pruning for high-performance web search engines

13 years 12 months ago

Download users.sch.gr

Results caching is an efficient technique for reducing the query processing load, hence it is commonly used in real search engines. This technique, however, bounds the maximum hit rate due to the large fraction of singleton queries, which is an important limitation. In this paper we propose ResIn - an architecture that uses a combination of results caching and index pruning to overcome this limitation. We argue that results caching is an inexpensive and efficient way to reduce the query processing load and show that it is cheaper to implement compared to a pruned index. At the same time, we show that index pruning performance is fundamentally affected by the changes in the query traffic that the results cache induces. We experiment with real query logs and a large document collection, and show that the combination of both techniques enables efficient reduction of the query processing costs and thus is practical to use in Web search engines. Categories and Subject Descriptors: H.3.3 [I...

Gleb Skobeltsyn, Flavio Junqueira, Vassilis Placho

Real-time Traffic

Information Technology | Query Processing | Query Processing Load | Search Engines | SIGIR 2008 |

claim paper

Post Info
More Details (n/a)

Added	28 Jan 2011
Updated	28 Jan 2011
Type	Journal
Year	2008
Where	SIGIR
Authors	Gleb Skobeltsyn, Flavio Junqueira, Vassilis Plachouras, Ricardo A. Baeza-Yates

Comments (0)

Sciweavers

ResIn: a combination of results caching and index pruning for high-performance web search engines

Information Technology | Query Processing | Query Processing Load | Search Engines | SIGIR 2008 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers