Sciweavers

SIGIR
2011
ACM

Faster temporal range queries over versioned text

13 years 2 months ago
Faster temporal range queries over versioned text
Versioned textual collections are collections that retain multiple versions of a document as it evolves over time. Important large-scale examples are Wikipedia and the web collection of the Internet Archive. Search queries over such collections often use keywords as well as temporal constraints, most commonly a time range of interest. In this paper, we study how to support such temporal range queries over versioned text. Our goal is to process these queries faster than the corresponding keyword-only queries, by exploiting the additional constraint. A simple approach might partition the index into different time ranges, and then access only the relevant parts. However, specialized inverted index compression techniques are crucial for large versioned collections, and a naive partitioning can negatively affect index compression and query throughput. We show how to achieve high query throughput by using smart index partitioning techniques that take index compression into account. Experi...
Jinru He, Torsten Suel
Added 17 Sep 2011
Updated 17 Sep 2011
Type Journal
Year 2011
Where SIGIR
Authors Jinru He, Torsten Suel
Comments (0)