Sciweavers

ICDE
2011
IEEE

Memory-constrained aggregate computation over data streams

13 years 2 months ago
Memory-constrained aggregate computation over data streams
— In this paper, we study the problem of efficiently computing multiple aggregation queries over a data stream. In order to share computation, prior proposals have suggested instantiating certain intermediate aggregates which are then used to generate the final answers for input queries. In this work, we make a number of important contributions aimed at improving the execution and generation of query plans containing intermediate aggregates. These include: (1) a different hashing model, which has low eviction rates, and also allows us to accurately estimate the number of evictions, (2) a comprehensive query execution cost model based on these estimates, (3) an efficient greedy heuristic for constructing good low-cost query plans, (4) provably near-optimal and optimal algorithms for allocating the available memory to aggregates in the query plan when the input data distribution is Zipf-like and Uniform, respectively, and (5) a detailed performance study with real-life IP flow data...
K. V. M. Naidu, Rajeev Rastogi, Scott Satkin, Anan
Added 29 Aug 2011
Updated 29 Aug 2011
Type Journal
Year 2011
Where ICDE
Authors K. V. M. Naidu, Rajeev Rastogi, Scott Satkin, Anand Srinivasan
Comments (0)