Sciweavers

647 search results - page 3 / 130
» Simulating Failures on Large-Scale Systems
Sort
View
ICPP
2008
IEEE
13 years 11 months ago
Dynamic Meta-Learning for Failure Prediction in Large-Scale Systems: A Case Study
Despite great efforts on the design of ultra-reliable components, the increase of system size and complexity has outpaced the improvement of component reliability. As a result, fa...
Jiexing Gu, Ziming Zheng, Zhiling Lan, John White,...
EUROPAR
2010
Springer
13 years 5 months ago
A Model for Space-Correlated Failures in Large-Scale Distributed Systems
Matthieu Gallet, Nezih Yigitbasi, Bahman Javadi, D...
P2P
2008
IEEE
120views Communications» more  P2P 2008»
13 years 11 months ago
Failure-Tolerant Overlay Trees for Large-Scale Dynamic Networks
Trees are fundamental structures for data dissemination in large-scale network scenarios. However, their inherent fragility has led researchers to rely on more redundant mesh topo...
Davide Frey, Amy L. Murphy
SRDS
1999
IEEE
13 years 9 months ago
Fault-Tolerant Replication Management in Large-Scale Distributed Storage Systems
Failures of all forms happen: from losing single network packets to site-wide disasters. Since businesses rely heavily on their data, it is imperative that failures require minima...
Richard A. Golding, Elizabeth Borowsky
ICDCS
2012
IEEE
11 years 7 months ago
Optimal Recovery from Large-Scale Failures in IP Networks
—Quickly recovering IP networks from failures is critical to enhancing Internet robustness and availability. Due to their serious impact on network routing, large-scale failures ...
Qiang Zheng, Guohong Cao, Tom La Porta, Ananthram ...