Selectivity Estimation of High Dimensional Window Queries via Clustering

14 years 10 months ago

Download www.dbs.informatik.uni-muenchen.de

Abstract. Query optimization is an important functionality of modern database systems and often based on estimating the selectivity of queries before actually executing them. Well-known techniques for estimating the result set size of a query are sampling and histogram-based solutions. Sampling-based approaches heavily depend on the size of the drawn sample which causes a trade-oﬀ between the quality of the estimation and the time in which the estimation can be executed for large data sets. Histogram-based techniques eliminate this problem but are limited to low-dimensional data sets. They either assume that all attributes are independent which is rarely true for real-world data or else get very ineﬃcient for high-dimensional data. In this paper we present the ﬁrst multivariate parametric method for estimating the selectivity of window queries for large and high-dimensional data sets. We use clustering to compress the data by generating a precise model of the data using multivari...

Christian Böhm, Hans-Peter Kriegel, Peer Kr&o

Real-time Traffic

Data Sets | Large Data Sets | Low-dimensional Data Sets | SSD 2005 |

claim paper

Added	28 Jun 2010
Updated	28 Jun 2010
Type	Conference
Year	2005
Where	SSD
Authors	Christian Böhm, Hans-Peter Kriegel, Peer Kröger, Petra Linhart

Sciweavers

Selectivity Estimation of High Dimensional Window Queries via Clustering

Data Sets | Large Data Sets | Low-dimensional Data Sets | SSD 2005 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers