Sciweavers

NOMS
2010
IEEE

Performance-driven task co-scheduling for MapReduce environments

13 years 11 months ago
Performance-driven task co-scheduling for MapReduce environments
—MapReduce is a data-driven programming model proposed by Google in 2004 which is especially well suited for distributed data analytics applications. We consider the management of MapReduce applications in an environment where multiple applications share the same physical resources. Such sharing is in line with recent trends in data center management which aim to consolidate workloads in order to achieve cost and energy savings. In a shared environment, it is necessary to predict and manage the performance of workloads given a set of performance goals defined for them. In this paper, we address this problem by introducing a new task scheduler for a MapReduce framework that allows performance-driven management of MapReduce tasks. The proposed task scheduler dynamically predicts the performance of concurrent MapReduce jobs and adjusts the resource allocation for the jobs. It allows applications to meet their performance objectives without overprovisioning of physical resources.
Jorda Polo, David Carrera, Yolanda Becerra, Malgor
Added 29 Jan 2011
Updated 29 Jan 2011
Type Journal
Year 2010
Where NOMS
Authors Jorda Polo, David Carrera, Yolanda Becerra, Malgorzata Steinder, Ian Whalley
Comments (0)