This paper describes an architecture and runtime system to implement distributed control and data processing applications in a thin-client manner, suitable for implementing a thin...
Abstract. An important step in achieving robustness to run-time faults is the ability to detect and repair problems when they arise in a running system. Effective fault detection a...
Paulo Casanova, Bradley R. Schmerl, David Garlan, ...
Abstract. Several computing environments including wide area networks and nondedicated networks of workstations are characterized by frequent unavailability of the participating ma...
This paper discusses ongoing work towards a theoretical basis intended to facilitate the development of self-regulating adaptive systems. Self-regulation refers to the capacity of...
Checkpointing is a commonly used approach to provide system fault-tolerance. However, using a constant checkpointing frequency may compromise the system's overall performance ...