Sciweavers

647 search results - page 30 / 130
» Simulating Failures on Large-Scale Systems
Sort
View
140
Voted
ICPP
2009
IEEE
15 years 9 months ago
Accelerating Checkpoint Operation by Node-Level Write Aggregation on Multicore Systems
—Clusters and applications continue to grow in size while their mean time between failure (MTBF) is getting smaller. Checkpoint/Restart is becoming increasingly important for lar...
Xiangyong Ouyang, Karthik Gopalakrishnan, Dhabales...
104
Voted
MR
2006
108views Robotics» more  MR 2006»
15 years 2 months ago
Electronic circuit reliability modeling
The intrinsic failure mechanisms and reliability models of state-of-the-art MOSFETs are reviewed. The simulation tools and failure equivalent circuits are described. The review in...
Joseph B. Bernstein, Moshe Gurfinkel, Xiaojun Li, ...
109
Voted
SRDS
2003
IEEE
15 years 7 months ago
Sharing Memory with Semi-Byzantine Clients and Faulty Storage Servers
This paper presents fault-tolerant simulations of a single-writer multi-reader regular register in storage systems. One simulation tolerates fail-stop failures of storage servers ...
Hagit Attiya, Amir Bar-Or
CN
2007
224views more  CN 2007»
15 years 2 months ago
Automated adaptive intrusion containment in systems of interacting services
Large scale distributed systems typically have interactions among different services that create an avenue for propagation of a failure from one service to another. The failures ...
Yu-Sung Wu, Bingrui Foo, Yu-Chun Mao, Saurabh Bagc...
WSC
2008
15 years 4 months ago
Partial-modular DEVS for improving performance of cellular space wildfire spread simulation
Simulation of wildfire spread remains to be a challenging task. In previous work, a cellular space fire spread simulation model has been developed based on the Discrete Event Syst...
Yi Sun, Xiaolin Hu