Search Sciweavers | Sciweavers

205

CCGRID
2006
IEEE

130views Distributed And Parallel Com...» more CCGRID 2006»

A Failure-Aware Scheduling Strategy in Large-Scale Cluster System

16 years 23 days ago

As the scale is expanding, node failure becomes a commonplace feature of large-scale cluster systems. As an important part of cluster operating system software, job scheduling tak...

Linping Wu, Dan Meng, Jianfeng Zhan, Wang Lei, Bib...

claim paper

Read More »

201

click to vote

CF
2009
ACM

154views Applied Computing» more CF 2009»

High accuracy failure injection in parallel and distributed systems using virtualization

15 years 4 months ago

Download www.lri.fr

Emulation sits between simulation and experimentation to complete the set of tools available for software designers to evaluate their software and predict behavior under condition...

Thomas Hérault, Thomas Largillier, Sylvain ...

claim paper

Read More »

194

Voted

KDD
2005
ACM

178views Data Mining» more KDD 2005»

Failure detection and localization in component based systems by online tracking

16 years 6 days ago

Download www.nec-labs.com

The increasing complexity of today’s systems makes fast and accurate failure detection essential for their use in mission-critical applications. Various monitoring methods provi...

Haifeng Chen, Guofei Jiang, Cristian Ungureanu, Ke...

claim paper

Read More »

157

click to vote

ICPPW
2008
IEEE

93views Distributed And Parallel Com...» more ICPPW 2008»

Simulating Failures on Large-Scale Systems

16 years 1 months ago

Download www.mcs.anl.gov

—Developing fault management mechanisms is a difﬁcult task because of the unpredictable nature of failures. In this paper, we present a fault simulation framework for Blue Gene...

Narayan Desai, Ewing L. Lusk, Daniel Buettner, And...

claim paper

Read More »

144

click to vote

SBACPAD
2005
IEEE

111views Hardware» more SBACPAD 2005»

VRM: A Failure-Aware Grid Resource Management System

16 years 8 days ago

Download kbs.cs.tu-berlin.de

Abstract— For resource management in Grid environments, advance reservations turned out to be very useful and hence are supported by a variety of Grid toolkits. However, failure ...

Lars-Olof Burchard, César A. F. De Rose, Ha...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers