Modern distributed applications pose increasing demands for high availability, automatic management, and dynamic conguration of their software systems. This paper presents the ar...
The paper addresses the problem of the construction and management of highly available services in large, open distributed systems. A novel replication protocol is proposed to sat...
We propose a domain-specific aspect language to prevent denial of service caused by resource management. Our aspects specify availability policies by enforcing time limits in the ...
— One of the major challenges in managing resources of computational Grids with diverse shared resources is how to meet users' QoS requirements and rationally distribute res...
Most of today‘s HPC systems employ a single head node for control, which represents a single point of failure as it interrupts an entire HPC system upon failure. Furthermore, it...
Kai Uhlemann, Christian Engelmann, Stephen L. Scot...