Duplicate-Insensitive Order Statistics Computation over Data Streams

15 years 5 months ago

Download www.cse.unsw.edu.au

—Duplicates in data streams may often be observed by the projection on a subspace and/or multiple recordings of objects. Without the uniqueness assumption on observed data elements, many conventional aggregates computation problems need to be further investigated due to their duplication sensitive nature. In this paper, we present novel, space-efﬁcient, one-scan algorithms to continuously maintain duplicate insensitive order sketches so that rank-based queries can be approximately processed with a relative rank error guarantee ǫ in the presence of data duplicates. Besides the space efﬁciency, the proposed algorithms are time-efﬁcient and highly accurate. Moreover, our techniques may be immediately applied to the heavy hitter problem against distinct elements and to the existing fault-tolerant distributed communication techniques. A comprehensive performance study demonstrates that our algorithms can support real-time computation against high speed data streams.

Ying Zhang, Xuemin Lin, Yidong Yuan, Masaru Kitsur

Real-time Traffic

Conventional Aggregates Computation | Data Streams | Duplication Sensitive Nature | TKDE 2010 |

claim paper

» Summarizing Order Statistics over Data Streams with Duplicates

» Continuously Maintaining Order Statistics over Data Streams

» Range counting over multidimensional data streams

» Memoryconstrained aggregate computation over data streams

» Spaceefficient Relative Error Order Sketch over Data Streams

» Processing complex aggregate queries over data streams

» Optimizing InOrder Execution of Continuous Queries over Streamed Sensor Data

» Continuous Monitoring of Distributed Data Streams over a Timebased Sliding Window

Post Info
More Details (n/a)

Added	31 Jan 2011
Updated	31 Jan 2011
Type	Journal
Year	2010
Where	TKDE
Authors	Ying Zhang, Xuemin Lin, Yidong Yuan, Masaru Kitsuregawa, Xiaofang Zhou, Jeffrey Xu Yu

Comments (0)

Sciweavers

Duplicate-Insensitive Order Statistics Computation over Data Streams

Conventional Aggregates Computation | Data Streams | Duplication Sensitive Nature | TKDE 2010 |

Explore & Download

Productivity Tools

Sciweavers