Query-Aware Sampling for Data Streams

16 years 1 months ago

Download ccs.njit.edu

Data Stream Management Systems are useful when large volumes of data need to be processed in real time. Examples include monitoring network traffic, monitoring financial transactions, and analyzing large scale scientific data feeds. These applications have varying data rates and often show bursts of high activity that overload the system, often during the most critical instants (e.g., network attacks, financial spikes) for analysis. Therefore, load shedding is necessary to preserve the stability of the system, gracefully degrade its performance and extract answers. Existing methods for load shedding in a general purpose data stream query system use random sampling of tuples, essentially independent of the query. While this technique is acceptable for some queries, the results may be meaningless or even incorrect for other queries. In principle, a number of different query-dependent sampling methods exist, but they work only for particular queries. In this paper, we show how to perform...

Theodore Johnson, S. Muthukrishnan, Vladislav Shka

Real-time Traffic

Data Stream | Database | ICDE 2007 | Load Shedding | Sampling Methods |

claim paper

» Structureaware sampling on data streams

» ReservoirBased Random Sampling with Replacement from Data Stream

» Continuous sampling from distributed streams

» Exploring Early Classification Strategies of Streaming Data with Delayed Attributes

» Reliable Transmission of Audio Streams in Lossy Channels Using Application Level Data Hidi...

» Innovation Rate Sampling of Pulse Streams With Application to Ultrasound Imaging

» Testing and SpotChecking of Data Streams

» Low Rate Sampling of Pulse Streams with Application to Ultrasound Imaging

Post Info
More Details (n/a)

Added	03 Jun 2010
Updated	03 Jun 2010
Type	Conference
Year	2007
Where	ICDE
Authors	Theodore Johnson, S. Muthukrishnan, Vladislav Shkapenyuk, Oliver Spatscheck

Comments (0)

Sciweavers

Query-Aware Sampling for Data Streams

Data Stream | Database | ICDE 2007 | Load Shedding | Sampling Methods |

Explore & Download

Productivity Tools

Sciweavers