Sciweavers

51 search results - page 3 / 11
» Handling Data Skew in MapReduce
Sort
View
ICDE
2010
IEEE
379views Database» more  ICDE 2010»
14 years 8 months ago
Osprey: Implementing MapReduce-Style Fault Tolerance in a Shared-Nothing Distributed Database
In this paper, we describe a scheme for tolerating and recovering from mid-query faults in a distributed shared nothing database. Rather than aborting and restarting queries, our s...
Christopher Yang, Christine Yen, Ceryen Tan, Samue...
SIGMOD
2012
ACM
226views Database» more  SIGMOD 2012»
11 years 11 months ago
SkewTune: mitigating skew in mapreduce applications
We present an automatic skew mitigation approach for userdefined MapReduce programs and present SkewTune, a system that implements this approach as a drop-in replacement for an e...
YongChul Kwon, Magdalena Balazinska, Bill Howe, Je...
SIGMOD
2002
ACM
84views Database» more  SIGMOD 2002»
14 years 8 months ago
Skew handling techniques in sort-merge join
Joins are among the most frequently executed operations. Several fast join algorithms have been developed and extensively studied; these can be categorized as sort-merge, hash-bas...
Wei Li, Dengfeng Gao, Richard T. Snodgrass
VLDB
1992
ACM
140views Database» more  VLDB 1992»
14 years 18 days ago
Practical Skew Handling in Parallel Joins
We present an approach to dealing with skew in parallel joins in database systems. Our approach is easily implementable within current parallel DBMS, and performs well on skewed d...
David J. DeWitt, Jeffrey F. Naughton, Donovan A. S...
EUROSYS
2011
ACM
13 years 1 days ago
Scarlett: coping with skewed content popularity in mapreduce clusters
To improve data availability and resilience MapReduce frameworks use file systems that replicate data uniformly. However, analysis of job logs from a large production cluster show...
Ganesh Ananthanarayanan, Sameer Agarwal, Srikanth ...