Sciweavers

2400 search results - page 163 / 480
» Systems Failures
Sort
View
DATE
2010
IEEE
131views Hardware» more  DATE 2010»
14 years 2 months ago
GentleCool: Cooling aware proactive workload scheduling in multi-machine systems
—In state of the art systems, workload scheduling and server fan speed operate independently leading to cooling inefficiencies. In this work we propose GentleCool, a proactive m...
Raid Ayoub, Shervin Sharifi, Tajana Simunic Rosing
FTDCS
1997
IEEE
14 years 1 months ago
Toward globally optimal resource management in large-scale real-time distributed computer systems
: This paper discusses the issues and promising approaches in (1) obtaining rigorous specifications of the quality-of-service (QoS) requirements associated with application functio...
K. H. Kim
ICCS
2007
Springer
14 years 1 months ago
A Dataflow-Oriented Atomicity and Provenance System for Pipelined Scientific Workflows
Scientific workflows have gained great momentum in recent years due to their critical roles in e-Science and cyberinfrastructure applications. However, some tasks of a scientific w...
Liqiang Wang, Shiyong Lu, Xubo Fei, Jeffrey L. Ram
ICECCS
1995
IEEE
108views Hardware» more  ICECCS 1995»
14 years 23 days ago
Using speculative execution for fault tolerance in a real-time system
Achieving fault-tolerance using a primary-backup approach involves overhead of recovery such as activating the backup and propagating execution states, which may a ect the timelin...
Mohamed F. Younis, Grace Tsai, Thomas J. Marlowe, ...
DSN
2003
IEEE
14 years 2 months ago
From Crash Tolerance to Authenticated Byzantine Tolerance: A Structured Approach, the Cost and Benefits
Many fault-tolerant group communication middleware systems have been implemented assuming crash failure semantics. While this assumption is not unreasonable, it becomes hard to ju...
Dimane Mpoeleng, Paul D. Ezhilchelvan, Neil A. Spe...