Sciweavers

SSDBM
2007
IEEE

Reservoir Sampling over Memory-Limited Stream Joins

14 years 5 months ago
Reservoir Sampling over Memory-Limited Stream Joins
In stream join processing with limited memory, uniform random sampling is useful for approximate query evaluation. In this paper, we address the problem of reservoir sampling over memory-limited stream joins. We present two sampling algorithms, Reservoir Join-Sampling (RJS) and Progressive Reservoir Join-Sampling (PRJS). RJS is designed straightforwardly by using a fixed-size reservoir sampling on a join-sample (i.e., random sample of a join output stream). Anytime the sample in the reservoir is used, RJS always gives a uniform random sample of the original join output stream. With limited memory, however, the available memory may not be large enough even for the join buffer, thereby severely limiting the reservoir size. PRJS alleviates this problem by increasing the reservoir size during the join-sampling 1 . This increasing is possible since the memory requirement by the join-sampling algorithm decreases over time. A larger reservoir provides a closer representation of the original...
Mohammed Al-Kateb, Byung Suk Lee, Xiaoyang Sean Wa
Added 04 Jun 2010
Updated 04 Jun 2010
Type Conference
Year 2007
Where SSDBM
Authors Mohammed Al-Kateb, Byung Suk Lee, Xiaoyang Sean Wang
Comments (0)