Modern distributed applications pose increasing demands for high availability, automatic management, and dynamic conguration of their software systems. This paper presents the architecture of Sampa, a System for Availability Management of Process-based Applications, which aims at fullling these requirements. The system has been designed to support the management of faulttolerant DCE-based distributed programs according to user-provided and application-specic availability specications. It is supposed to detect and automatically react to faults such as node crashes, network partitions, process crashes, and hang-ups. In this paper, we focus on the design of some of its services { the monitoring, checkpointing, and conguration management facilities { and show how they can be used for managing a generic fault-tolerant service.