Sciweavers

SIGIR
2008
ACM

ResIn: a combination of results caching and index pruning for high-performance web search engines

13 years 11 months ago
ResIn: a combination of results caching and index pruning for high-performance web search engines
Results caching is an efficient technique for reducing the query processing load, hence it is commonly used in real search engines. This technique, however, bounds the maximum hit rate due to the large fraction of singleton queries, which is an important limitation. In this paper we propose ResIn - an architecture that uses a combination of results caching and index pruning to overcome this limitation. We argue that results caching is an inexpensive and efficient way to reduce the query processing load and show that it is cheaper to implement compared to a pruned index. At the same time, we show that index pruning performance is fundamentally affected by the changes in the query traffic that the results cache induces. We experiment with real query logs and a large document collection, and show that the combination of both techniques enables efficient reduction of the query processing costs and thus is practical to use in Web search engines. Categories and Subject Descriptors: H.3.3 [I...
Gleb Skobeltsyn, Flavio Junqueira, Vassilis Placho
Added 28 Jan 2011
Updated 28 Jan 2011
Type Journal
Year 2008
Where SIGIR
Authors Gleb Skobeltsyn, Flavio Junqueira, Vassilis Plachouras, Ricardo A. Baeza-Yates
Comments (0)