ApproxHadoop: Bringing Approximations to MapReduce Frameworks

9 years 10 months ago

Download www.cs.rutgers.edu

We propose and evaluate a framework for creating and running approximation-enabled MapReduce programs. Specifically, we propose approximation mechanisms that ﬁt naturally into the MapReduce paradigm, including input data sampling, task dropping, and accepting and running a precise and a user-deﬁned approximate version of the MapReduce code. We then show how to leverage statistical theories to compute error bounds for popular classes of MapReduce programs when approximating with input data sampling and/or task dropping. We implement the proposed mechanisms and error bound estimations in a prototype system called ApproxHadoop. Our evaluation uses MapReduce applications from different domains, including data analytics, scientiﬁc computing, video encoding, and machine learning. Our results show that ApproxHadoop can signiﬁcantly reduce application execution time and/or energy consumption when the user is willing to tolerate small errors. For example, ApproxHadoop can reduce runtim...

Iñigo Goiri, Ricardo Bianchini, Santosh Nag

Real-time Traffic

ASPLOS 2015 | Programming Languages |

claim paper

Post Info
More Details (n/a)

Added	16 Apr 2016
Updated	16 Apr 2016
Type	Journal
Year	2015
Where	ASPLOS
Authors	Iñigo Goiri, Ricardo Bianchini, Santosh Nagarakatte, Thu D. Nguyen

Comments (0)

Sciweavers

ApproxHadoop: Bringing Approximations to MapReduce Frameworks

ASPLOS 2015 | Programming Languages |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers