Search Sciweavers | Sciweavers

1119 search results - page 15 / 224

» Computing in the Presence of Timing Failures

click to vote

PODC
2012
ACM

251views Distributed and Parallel Com...» more PODC 2012»

Asynchronous failure detectors

11 years 10 months ago

Download srikanth.sastry.name

Failure detectors — oracles that provide information about process crashes — are an important ion for crash tolerance in distributed systems. Although current failure-detector...

Alejandro Cornejo, Nancy A. Lynch, Srikanth Sastry

claim paper

Read More »

click to vote

INFOCOM
2010
IEEE

154views Communications» more INFOCOM 2010»

Network Coding Tomography for Network Failures

13 years 6 months ago

Download personal.ie.cuhk.edu.hk

—Network Tomography (or network monitoring) uses end-to-end path-level measurements to characterize the network, such as topology estimation and failure detection. This work prov...

Hongyi Yao, Sidharth Jaggi, Minghua Chen

claim paper

Read More »

click to vote

ICPP
2007
IEEE

123views Distributed And Parallel Com...» more ICPP 2007»

A Meta-Learning Failure Predictor for Blue Gene/L Systems

14 years 1 months ago

Download www.mcs.anl.gov

The demand for more computational power in science and engineering has spurred the design and deployment of ever-growing cluster systems. Even though the individual components use...

Prashasta Gujrati, Yawei Li, Zhiling Lan, Rajeev T...

claim paper

Read More »

click to vote

CCGRID
2010
IEEE

144views Distributed And Parallel Com...» more CCGRID 2010»

Selective Recovery from Failures in a Task Parallel Programming Model

13 years 8 months ago

Download www.cse.ohio-state.edu

Abstract--We present a fault tolerant task pool execution environment that is capable of performing fine-grain selective restart using a lightweight, distributed task completion tr...

James Dinan, Arjun Singri, P. Sadayappan, Sriram K...

claim paper

Read More »

click to vote

ICPPW
2009
IEEE

183views Distributed And Parallel Com...» more ICPPW 2009»

Decentralized Load Balancing for Improving Reliability in Heterogeneous Distributed Systems

14 years 2 months ago

Download www.eece.unm.edu

Abstract—A probabilistic analytical framework for decentralized load balancing (LB) strategies for heterogeneous distributed-computing systems (DCSs) is presented with the overal...

Jorge E. Pezoa, Sagar Dhakal, Majeed M. Hayat

claim paper

Read More »

« Prev « First page 15 / 224 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers