Linked Bernoulli Synopses: Sampling along Foreign Keys

16 years 28 days ago

Download wwwdb.inf.tu-dresden.de

Random sampling is a popular technique for providing fast approximate query answers, especially in data warehouse environments. Compared to other types of synopses, random sampling bears the advantage of retaining the dataset’s dimensionality; it also associates probabilistic error bounds with the query results. Most of the available sampling techniques focus on table-level sampling, that is, they produce a sample of only a single database table. Queries that contain joins over multiple tables cannot be answered with such samples because join results on random samples are often small and skewed. On the contrary, schema-level sampling techniques by design support queries containing joins. In this paper, we introduce Linked Bernoulli Synopses, a schemalevel sampling scheme based upon the well-known Join Synopses. Both schemes rely on the idea of maintaining foreign-key integrity in the synopses; they are therefore suited to process queries containing arbitrary foreign-key joins. In con...

Rainer Gemulla, Philipp Rösch, Wolfgang Lehne

Real-time Traffic

Bernoulli Synopses | Database | Random Sampling | Sampling Techniques | SSDBM 2008 |

claim paper

Post Info
More Details (n/a)

Added	01 Jun 2010
Updated	01 Jun 2010
Type	Conference
Year	2008
Where	SSDBM
Authors	Rainer Gemulla, Philipp Rösch, Wolfgang Lehner

Comments (0)

Sciweavers

Linked Bernoulli Synopses: Sampling along Foreign Keys

Bernoulli Synopses | Database | Random Sampling | Sampling Techniques | SSDBM 2008 |

Explore & Download

Productivity Tools

Sciweavers