Sciweavers

647 search results - page 30 / 130
» Simulating Failures on Large-Scale Systems
Sort
View
ICPP
2009
IEEE
14 years 2 months ago
Accelerating Checkpoint Operation by Node-Level Write Aggregation on Multicore Systems
—Clusters and applications continue to grow in size while their mean time between failure (MTBF) is getting smaller. Checkpoint/Restart is becoming increasingly important for lar...
Xiangyong Ouyang, Karthik Gopalakrishnan, Dhabales...
MR
2006
108views Robotics» more  MR 2006»
13 years 8 months ago
Electronic circuit reliability modeling
The intrinsic failure mechanisms and reliability models of state-of-the-art MOSFETs are reviewed. The simulation tools and failure equivalent circuits are described. The review in...
Joseph B. Bernstein, Moshe Gurfinkel, Xiaojun Li, ...
SRDS
2003
IEEE
14 years 1 months ago
Sharing Memory with Semi-Byzantine Clients and Faulty Storage Servers
This paper presents fault-tolerant simulations of a single-writer multi-reader regular register in storage systems. One simulation tolerates fail-stop failures of storage servers ...
Hagit Attiya, Amir Bar-Or
CN
2007
224views more  CN 2007»
13 years 8 months ago
Automated adaptive intrusion containment in systems of interacting services
Large scale distributed systems typically have interactions among different services that create an avenue for propagation of a failure from one service to another. The failures ...
Yu-Sung Wu, Bingrui Foo, Yu-Chun Mao, Saurabh Bagc...
WSC
2008
13 years 10 months ago
Partial-modular DEVS for improving performance of cellular space wildfire spread simulation
Simulation of wildfire spread remains to be a challenging task. In previous work, a cellular space fire spread simulation model has been developed based on the Discrete Event Syst...
Yi Sun, Xiaolin Hu