Kernel-based skyline cardinality estimation

15 years 1 months ago

Download www.cse.ust.hk

The skyline of a d-dimensional dataset consists of all points not dominated by others. The incorporation of the skyline operator into practical database systems necessitates an efficient and effective cardinality estimation module. However, existing theoretical work on this problem is limited to the case where all d dimensions are independent of each other, which rarely holds for real datasets. The state of the art Log Sampling (LS) technique simply applies theoretical results for independent dimensions to non-independent data anyway, sometimes leading to large estimation errors. To solve this problem, we propose a novel Kernel-Based (KB) approach that approximates the skyline cardinality with nonparametric methods. Extensive experiments with various real datasets demonstrate that KB achieves high accuracy, even in cases where LS fails. At the same time, despite its numerical nature, the efficiency of KB is comparable to that of LS. Furthermore, we extend both LS and KB to the k-domin...

Zhenjie Zhang, Yin Yang, Ruichu Cai, Dimitris Papa

Real-time Traffic

Cardinality Estimation Module | Conventional Skyline | Database | SIGMOD 2009 | Skyline Cardinality |

claim paper

Post Info
More Details (n/a)

Added	05 Dec 2009
Updated	05 Dec 2009
Type	Conference
Year	2009
Where	SIGMOD
Authors	Zhenjie Zhang, Yin Yang, Ruichu Cai, Dimitris Papadias, Anthony K. H. Tung

Comments (0)

Sciweavers

Kernel-based skyline cardinality estimation

Cardinality Estimation Module | Conventional Skyline | Database | SIGMOD 2009 | Skyline Cardinality |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers