A Taxonomy and Performance Model of Data Skew Effects in Parallel Joins

15 years 5 months ago

Download www.vldb.org

Recent work on parallel joins and data skew has concentrated on algorithm design without considering the causes and chara.cteristics of data. skew itself. Existming ana.lyt,ic models of skew do not cont.ain enough informat,ion to fully describe data skew in parallel implementations. Because the assumptions made about the nature of skew vary between authors, it is almost impossible to make valid comparisons of parallel algorithms. In t,his paper, a taxonomy of skew effects is developed, and a. new performance model is introduced. The model is used to compare the performance of two parallel join algorithms.

Christopher B. Walton, Alfred G. Dale, Roy M. Jene

Real-time Traffic

Data Skew | Database | Parallel Join | Skew Effects | VLDB 1991 |

claim paper

» Parallel PointerBased Join Algorithms in Memorymapped Environments

» Processing thetajoins using MapReduce

» Skewaware automatic database partitioning in sharednothing parallel OLTP systems

» Executing Stream Joins on the Cell Processor

Post Info
More Details (n/a)

Added	27 Aug 2010
Updated	27 Aug 2010
Type	Conference
Year	1991
Where	VLDB
Authors	Christopher B. Walton, Alfred G. Dale, Roy M. Jenevein

Comments (0)

Sciweavers

A Taxonomy and Performance Model of Data Skew Effects in Parallel Joins

Data Skew | Database | Parallel Join | Skew Effects | VLDB 1991 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers