Sciweavers

647 search results - page 3 / 130
» Simulating Failures on Large-Scale Systems
Sort
View
ICPP
2008
IEEE
14 years 7 days ago
Dynamic Meta-Learning for Failure Prediction in Large-Scale Systems: A Case Study
Despite great efforts on the design of ultra-reliable components, the increase of system size and complexity has outpaced the improvement of component reliability. As a result, fa...
Jiexing Gu, Ziming Zheng, Zhiling Lan, John White,...
EUROPAR
2010
Springer
13 years 6 months ago
A Model for Space-Correlated Failures in Large-Scale Distributed Systems
Matthieu Gallet, Nezih Yigitbasi, Bahman Javadi, D...
P2P
2008
IEEE
120views Communications» more  P2P 2008»
14 years 5 days ago
Failure-Tolerant Overlay Trees for Large-Scale Dynamic Networks
Trees are fundamental structures for data dissemination in large-scale network scenarios. However, their inherent fragility has led researchers to rely on more redundant mesh topo...
Davide Frey, Amy L. Murphy
SRDS
1999
IEEE
13 years 10 months ago
Fault-Tolerant Replication Management in Large-Scale Distributed Storage Systems
Failures of all forms happen: from losing single network packets to site-wide disasters. Since businesses rely heavily on their data, it is imperative that failures require minima...
Richard A. Golding, Elizabeth Borowsky
ICDCS
2012
IEEE
11 years 8 months ago
Optimal Recovery from Large-Scale Failures in IP Networks
—Quickly recovering IP networks from failures is critical to enhancing Internet robustness and availability. Due to their serious impact on network routing, large-scale failures ...
Qiang Zheng, Guohong Cao, Tom La Porta, Ananthram ...