Search Sciweavers | Sciweavers

32 search results - page 1 / 7

» Performance Implications of Failures in Large-Scale Cluster ...

216

Voted

JSSPP
2004
Springer

143views Distributed And Parallel Com...» more JSSPP 2004»

Performance Implications of Failures in Large-Scale Cluster Scheduling

16 years 26 days ago

Download www.ece.rutgers.edu

As we continue to evolve into large-scale parallel systems, many of them employing hundreds of computing engines to take on mission-critical roles, it is crucial to design those s...

Yanyong Zhang, Mark S. Squillante, Anand Sivasubra...

claim paper

Read More »

193

click to vote

IPPS
2005
IEEE

132views Distributed And Parallel Com...» more IPPS 2005»

Performance Implications of Periodic Checkpointing on Large-Scale Cluster Systems

16 years 1 months ago

Download adam.oliner.net

Large-scale systems like BlueGene/L are susceptible to a number of software and hardware failures that can affect system performance. Periodic application checkpointing is a commo...

Adam J. Oliner, Ramendra K. Sahoo, José E. ...

claim paper

Read More »

233

click to vote

CCGRID
2006
IEEE

130views Distributed And Parallel Com...» more CCGRID 2006»

A Failure-Aware Scheduling Strategy in Large-Scale Cluster System

16 years 1 months ago

Download www.ncic.ac.cn

As the scale is expanding, node failure becomes a commonplace feature of large-scale cluster systems. As an important part of cluster operating system software, job scheduling tak...

Linping Wu, Dan Meng, Jianfeng Zhan, Wang Lei, Bib...

claim paper

Read More »

208

Voted

ESCIENCE
2006
IEEE

125views Distributed And Parallel Com...» more ESCIENCE 2006»

Job Failure Analysis and Its Implications in a Large-Scale Production Grid

16 years 1 months ago

Download www.liacs.nl

In this paper we present an initial analysis of job failures in a large-scale data-intensive Grid. Based on three representative periods in production, we characterize the interar...

Hui Li, David L. Groep, Lex Wolters, Jeffrey Templ...

claim paper

Read More »

223

Voted

ECRTS
2007
IEEE

164views Embedded Systems» more ECRTS 2007»

A Hybrid Real-Time Scheduling Approach for Large-Scale Multicore Platforms

16 years 1 months ago

Download www.cs.unc.edu

We propose a hybrid approach for scheduling real-time tasks on large-scale multicore platforms with hierarchical shared caches. In this approach, a multicore platform is partition...

John M. Calandrino, James H. Anderson, Dan P. Baum...

claim paper

Read More »

« Prev « First page 1 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers