Sciweavers

252 search results - page 5 / 51
» An Index-Based Checkpointing Algorithm for Autonomous Distri...
Sort
View
109
Voted
ICS
2011
Tsinghua U.
14 years 27 days ago
High performance linpack benchmark: a fault tolerant implementation without checkpointing
The probability that a failure will occur before the end of the computation increases as the number of processors used in a high performance computing application increases. For l...
Teresa Davies, Christer Karlsson, Hui Liu, Chong D...
57
Voted
SASO
2008
IEEE
15 years 3 months ago
Autonomic Request Management Algorithms for Geographically Distributed Internet-Based Systems
Supporting Web-based services through geographical distributed clusters of servers is a common solution to the increasing volume and variability of modern traffic. These architec...
Mauro Andreolini, Sara Casolari, Michele Colajanni
SC
2009
ACM
15 years 4 months ago
FALCON: a system for reliable checkpoint recovery in shared grid environments
In Fine-Grained Cycle Sharing (FGCS) systems, machine owners voluntarily share their unused CPU cycles with guest jobs, as long as the performance degradation is tolerable. For gu...
Tanzima Zerin Islam, Saurabh Bagchi, Rudolf Eigenm...
ICAC
2009
IEEE
14 years 7 months ago
Using distributed w-learning for multi-policy optimization in decentralized autonomic systems
Distributed W-Learning (DWL) is a reinforcement learningbased algorithm for multi-policy optimization in agent-based systems. In this poster we propose the use of DWL for decentra...
Ivana Dusparic, Vinny Cahill
104
Voted
SASO
2010
IEEE
14 years 7 months ago
Extracting Overlay Invariants of Distributed Systems for Autonomic System Management
Many large-scale distributed systems have been built with great complexity to run Internet services. Due to the heterogeneity and dynamics of complex systems, it is very difficult ...
Hanhuai Shan, Guofei Jiang, Kenji Yoshihira