Sciweavers

SIGMOD
2010
ACM

Unbiased estimation of size and other aggregates over hidden web databases

14 years 15 days ago
Unbiased estimation of size and other aggregates over hidden web databases
Many websites provide restrictive form-like interfaces which allow users to execute search queries on the underlying hidden databases. In this paper, we consider the problem of estimating the size of a hidden database through its web interface. We propose novel techniques which use a small number of queries to produce unbiased estimates with small variance. These techniques can also be used for approximate query processing over hidden databases. We present theoretical analysis and extensive experiments to illustrate the effectiveness of our approach. Categories and Subject Descriptors H.2.7 [Database Administration]; H.3.5 [Online Information Services]: Web-based services General Terms Algorithms, Measurement, Performance Keywords Hidden Databases, Aggregate Query Processing
Arjun Dasgupta, Xin Jin, Bradley Jewell, Nan Zhang
Added 06 Dec 2010
Updated 06 Dec 2010
Type Conference
Year 2010
Where SIGMOD
Authors Arjun Dasgupta, Xin Jin, Bradley Jewell, Nan Zhang 0004, Gautam Das
Comments (0)