Sciweavers

VLDB
1997
ACM

A One-Pass Algorithm for Accurately Estimating Quantiles for Disk-Resident Data

14 years 4 months ago
A One-Pass Algorithm for Accurately Estimating Quantiles for Disk-Resident Data
The cpquantile of an ordered sequenceof data values is the element with rank ‘pn, where n is the total number of values. Accurate estimates of quantiles are required for the solution of many practical problems. In this paper, we present a new algorithm for estimating the quantile values for disk-resident data. Our algorithm has the following characteristics: (1) It requires only onepassover the data; (2) It is deterministic; (3) It produces good lower and upper bounds of the true values of the quantiles; (4) It requires no a priori knowledge of the distribution of the data set; (5) It has a scalable parallel formulation; (6) Extra time and memory for computing additional quantiles (beyond the first one) are constant per quantile. We present experimental results on the IBM SP-2. The experimental results show that the algorithm is indeed robust and does not depend on the distribution of the data sets.
Khaled Alsabti, Sanjay Ranka, Vineet Singh
Added 08 Aug 2010
Updated 08 Aug 2010
Type Conference
Year 1997
Where VLDB
Authors Khaled Alsabti, Sanjay Ranka, Vineet Singh
Comments (0)