Search Sciweavers | Sciweavers

114 search results - page 16 / 23

» Speculative Parallelization - Eliminating the Overhead of Fa...

239

click to vote

HCW
2000
IEEE

170views Distributed And Parallel Com...» more HCW 2000»

Evaluation of PAMS' Adaptive Management Services

15 years 10 months ago

Download acl.ece.arizona.edu

Management of large-scale parallel and distributed applications is an extremely complex task due to factors such as centralized management architectures, lack of coordination and ...

Yoonhee Kim, Salim Hariri, Muhamad Djunaedi

claim paper

Read More »

145

click to vote

ICDCS
2000
IEEE

146views Distributed And Parallel Com...» more ICDCS 2000»

Coherence-based Coordinated Checkpointing for Software Distributed Shared Memory Systems

15 years 10 months ago

Download www.cacs.louisiana.edu

Fault-tolerant techniques that can cope with system failures in software distributed shared memory (SDSM) are essential for creating productive and highly available parallel compu...

Angkul Kongmunvattana, Santipong Tanchatchawal, Ni...

claim paper

Read More »

218

click to vote

ICDCS
2012
IEEE

238views Distributed And Parallel Com...» more ICDCS 2012»

Combining Partial Redundancy and Checkpointing for HPC

13 years 8 months ago

Download moss.csc.ncsu.edu

Today’s largest High Performance Computing (HPC) systems exceed one Petaﬂops (1015 ﬂoating point operations per second) and exascale systems are projected within seven years...

James Elliott, Kishor Kharbas, David Fiala, Frank ...

claim paper

Read More »

262

click to vote

PPOPP
2003
ACM

140views Distributed And Parallel Com...» more PPOPP 2003»

Automated application-level checkpointing of MPI programs

15 years 11 months ago

Download iss.ices.utexas.edu

Because of increasing hardware and software complexity, the running time of many computational science applications is now more than the mean-time-to-failure of highpeformance com...

Greg Bronevetsky, Daniel Marques, Keshav Pingali, ...

claim paper

Read More »

156

click to vote

PODC
1990
ACM

134views Distributed and Parallel Com...» more PODC 1990»

Sharing Memory Robustly in Message-Passing Systems

15 years 10 months ago

Download www.cs.huji.ac.il

Emulators that translate algorithms from the shared-memory model to two different message-passing models are presented. Both are achieved by implementing a wait-free, atomic, singl...

Hagit Attiya, Amotz Bar-Noy, Danny Dolev

claim paper

Read More »

« Prev « First page 16 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers