Sciweavers

USENIX
2008

Automatic Optimization of Parallel Dataflow Programs

14 years 2 months ago
Automatic Optimization of Parallel Dataflow Programs
Large-scale parallel dataflow systems, e.g., Dryad and Map-Reduce, have attracted significant attention recently. High-level dataflow languages such as Pig Latin and Sawzall are being layered on top of these systems, to enable faster program development and more maintainable code. These languages engender greater transparency in program structure, and open up opportunities for automatic optimization. This paper proposes a set of optimization strategies for this context, drawing on and extending techniques from the database community.
Christopher Olston, Benjamin Reed, Adam Silberstei
Added 02 Oct 2010
Updated 02 Oct 2010
Type Conference
Year 2008
Where USENIX
Authors Christopher Olston, Benjamin Reed, Adam Silberstein, Utkarsh Srivastava
Comments (0)