Sampling cube: a framework for statistical olap over sampling data

16 years 6 months ago

Download www.xiaolei.org

Sampling is a popular method of data collection when it is impossible or too costly to reach the entire population. For example, television show ratings in the United States are gathered from a sample of roughly 5,000 households. To use the results effectively, the samples are further partitioned in a multidimensional space based on multiple attribute values. This naturally leads to the desirability of OLAP (Online Analytical Processing) over sampling data. However, unlike traditional data, sampling data is inherently uncertain, i.e., not representing the full data in the population. Thus, it is desirable to return not only query results but also the confidence intervals indicating the reliability of the results. Moreover, a certain segment in a multidimensional space may contain none or too few samples. This requires some additional analysis to return trustable results. In this paper we propose a Sampling Cube framework, which efficiently calculates confidence intervals for any multi...

Xiaolei Li, Jiawei Han, Zhijun Yin, Jae-Gil Lee, Y

Real-time Traffic

Database | Multidimensional Query | Multidimensional Space | Sampling Cube Shell | SIGMOD 2008 |

claim paper

» General Framework on Change Detection in a Sparse Domain

» Probabilistic MultiShape Representation Using an Isometric LogRatio Mapping

» WaveletBased Histograms for Selectivity Estimation

» A quantitative framework for automated preexecution thread selection

» Conditional Likelihood Maximisation A Unifying Framework for Information Theoretic Feature...

» Bayesian spatial modeling and interpolation using copulas

» Semisupervised discovery of differential genes

» The use of online cotraining to reduce the training set size in pattern recognition method...

Post Info
More Details (n/a)

Added	08 Dec 2009
Updated	08 Dec 2009
Type	Conference
Year	2008
Where	SIGMOD
Authors	Xiaolei Li, Jiawei Han, Zhijun Yin, Jae-Gil Lee, Yizhou Sun

Comments (0)

Sciweavers

Sampling cube: a framework for statistical olap over sampling data

Database | Multidimensional Query | Multidimensional Space | Sampling Cube Shell | SIGMOD 2008 |

Explore & Download

Productivity Tools

Sciweavers