This paper is on the construction of a fault-tolerant and responsive server subsystem in an application context where the subsystem is accessed through an asynchronous network by a large number of clients. The server is made fault-tolerant by the Triple Modular Redundancy (TMR) technique: at least two server processes behave correctly, while the third one can behave arbitrarily. An essential requirement for process replication is that the client inputs be delivered to server replicas for processing in an identical order. Moreover, in order to cope with process’ memory requirement, a time bound constraint is imposed: no client input can stay in the local memory of a server process more than ¢ units of time. Based on known technologies, two assumptions are made: (1) the network delivers a given client input to any two server processes within a known bounded time (£ ), and (2), there is an Ordered Timed Atomic Broadcast protocol built on top of the TMR system with timeliness¤ . The ...
Paul D. Ezhilchelvan, Jean-Michel Hélary, M