iJoin: Importance-Aware Join Approximation over Data Streams

14 years 6 months ago

Download people.bu.edu

We consider approximate join processing over data streams when memory limitations cause incoming tuples to overﬂow the available space, precluding exact processing. Selective eviction of tuples (loadshedding) is needed, but is challenging since data distributions and arrival rates are unknown a priori. Also, in many real-world applications such as for the stock market and sensor-data, diﬀerent items may have diﬀerent importance levels. Current methods pay little attention to load-shedding when tuples bear such importance semantics, and perform poorly due to premature tuple drops and unproductive tuple retention. We propose a novel framework, called iJoin, which overcomes these drawbacks, and also provides tuples a fair chance in being part of the join result. Our load-shedding scheme for iJoin maximizes the total importance of join results, and allows reconﬁguration of tuple-importance. We also show how to trade oﬀ load-shedding overhead and approximation-error. Our experimen...

Dhananjay Kulkarni, Chinya V. Ravishankar

Real-time Traffic

Approximate Join Processing | Database | Diﬀerent Importance Levels | SSDBM 2008 | Tuples |

claim paper

Post Info
More Details (n/a)

Added	01 Jun 2010
Updated	01 Jun 2010
Type	Conference
Year	2008
Where	SSDBM
Authors	Dhananjay Kulkarni, Chinya V. Ravishankar

Comments (0)

Sciweavers

iJoin: Importance-Aware Join Approximation over Data Streams

Approximate Join Processing | Database | Diﬀerent Importance Levels | SSDBM 2008 | Tuples |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers