Search Sciweavers | Sciweavers

106 search results - page 11 / 22

» Transparent Fault Tolerance for Grid Applications

220

click to vote

ICS
2007
Tsinghua U.

167views Distributed And Parallel Com...» more ICS 2007»

Proactive fault tolerance for HPC with Xen virtualization

16 years 1 months ago

Download www.csm.ornl.gov

Large-scale parallel computing is relying increasingly on clusters with thousands of processors. At such large counts of compute nodes, faults are becoming common place. Current t...

Arun Babu Nagarajan, Frank Mueller, Christian Enge...

claim paper

Read More »

199

click to vote

EUROSYS
2011
ACM

218views Software Engineering» more EUROSYS 2011»

Refuse to crash with Re-FUSE

14 years 11 months ago

Download pages.cs.wisc.edu

We introduce Re-FUSE, a framework that provides support for restartable user-level ﬁle systems. Re-FUSE monitors the user-level ﬁle-system and on a crash transparently restart...

Swaminathan Sundararaman, Laxman Visampalli, Andre...

claim paper

Read More »

337

click to vote

CCGRID
2008
IEEE

191views Distributed And Parallel Com...» more CCGRID 2008»

An Autonomic Workflow Management System for Global Grids

16 years 1 months ago

Download www.gridbus.org

Workflow Management System is generally utilized to define, manage and execute workflow applications on Grid resources. However, the increasing scale complexity, heterogeneity and...

Mustafizur Rahman 0003, Rajkumar Buyya

claim paper

Read More »

226

click to vote

LCPC
2009
Springer

173views System Software» more LCPC 2009»

A Communication Framework for Fault-Tolerant Parallel Execution

16 years 5 hour ago

Download www2.cs.uh.edu

PC grids represent massive computation capacity at a low cost, but are challenging to employ for parallel computing because of variable and unpredictable performance and availabili...

Nagarajan Kanna, Jaspal Subhlok, Edgar Gabriel, Es...

claim paper

Read More »

189

click to vote

ESCIENCE
2006
IEEE

133views Distributed And Parallel Com...» more ESCIENCE 2006»

Practical Fault-Tolerant Framework for eScience Infrastructure

16 years 1 months ago

Download dcslab.snu.ac.kr

Many areas of science currently use computing resources as a important part of their research, and many research groups adopt cluster architecture to use them eﬃciently and mana...

Hyuck Han, Jai Wug Kim, Jongpil Lee, Youngjin Yu, ...

claim paper

Read More »

« Prev « First page 11 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers