Search Sciweavers | Sciweavers

46 search results - page 1 / 10

» Rebound: scalable checkpointing for coherent shared memory

229

Voted

ISCA
2011
IEEE

238views Hardware» more ISCA 2011»

Rebound: scalable checkpointing for coherent shared memory

14 years 11 months ago

Download iacoma.cs.uiuc.edu

As we move to large manycores, the hardware-based global checkpointing schemes that have been proposed for small shared-memory machines do not scale. Scalability barriers include ...

Rishi Agarwal, Pranav Garg, Josep Torrellas

claim paper

Read More »

245

Voted

SC
2000
ACM

110views Applied Computing» more SC 2000»

Scalable Fault-Tolerant Distributed Shared Memory

15 years 11 months ago

Download www.sc2000.org

This paper shows how a state-of-the-art software distributed shared-memory (DSM) protocol can be eﬃciently extended to tolerate single-node failures. In particular, we extend a ...

Florin Sultan, Thu D. Nguyen, Liviu Iftode

claim paper

Read More »

224

click to vote

ISCA
2002
IEEE

115views Hardware» more ISCA 2002»

SafetyNet: Improving the Availability of Shared Memory Multiprocessors with Global Checkpoint/Recovery

16 years 9 days ago

Download www.cs.wisc.edu

We develop an availability solution, called SafetyNet, that uses a uniﬁed, lightweight checkpoint/recovery mechanism to support multiple long-latency fault detection schemes. At...

Daniel J. Sorin, Milo M. K. Martin, Mark D. Hill, ...

claim paper

Read More »

175

click to vote

PPAM
2005
Springer

137views Distributed And Parallel Com...» more PPAM 2005»

Checkpointing Speculative Distributed Shared Memory

16 years 26 days ago

Download www.cs.put.poznan.pl

This paper describes a checkpointing mechanism destined for Distributed Shared Memory (DSM) systems with speculative prefetching. Speculation is a general technique involving predi...

Arkadiusz Danilecki, Anna Kobusinska, Michal Szych...

claim paper

Read More »

225

Voted

SRDS
1994
IEEE

120views Operating System» more SRDS 1994»

Coordinated Checkpointing-Rollback Error Recovery for Distributed Shared Memory Multicomputers

15 years 11 months ago

Download fmdb.cs.ucla.edu

Most recovery schemes that have been proposed for Distributed Shared Memory (DSM) systems require unnecessarily high checkpointing frequency and checkpoint traffic, which are sens...

G. Janakiraman, Yuval Tamir

claim paper

Read More »

« Prev « First page 1 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers