Sciweavers

1166 search results - page 19 / 234
» Crash Management for Distributed Parallel Systems
Sort
View
ICDCS
2012
IEEE
13 years 3 months ago
Tiresias: Online Anomaly Detection for Hierarchical Operational Network Data
Operational network data, management data such as customer care call logs and equipment system logs, is a very important source of information for network operators to detect prob...
Chi-Yao Hong, Matthew Caesar, Nick G. Duffield, Ji...
PODC
2004
ACM
15 years 6 months ago
Asynchronous group key exchange with failures
Group key exchange protocols allow a group of servers communicating over an asynchronous network of point-to-point links to establish a common key, such that an adversary which fu...
Christian Cachin, Reto Strobl
DNA
2009
Springer
145views Bioinformatics» more  DNA 2009»
15 years 7 months ago
Distributed Agreement in Tile Self-assembly
Abstract. Laboratory investigations have shown that a formal theory of fault-tolerance will be essential to harness nanoscale self-assembly as a medium of computation. Several rese...
Aaron Sterling
HPDC
2000
IEEE
15 years 5 months ago
A Monitoring Sensor Management System for Grid Environments
Large distributed systems such as Computational Grids require a large amount of monitoring data be collected for a variety of tasks such as fault detection, performance analysis, ...
Brian Tierney, Brian Crowley, Dan Gunter, Mason Ho...
SIGMOD
2007
ACM
158views Database» more  SIGMOD 2007»
16 years 1 months ago
Log-based recovery for middleware servers
We have developed new methods for log-based recovery for middleware servers which involve thread pooling, private inmemory states for clients, shared in-memory state and message i...
Rui Wang 0002, Betty Salzberg, David B. Lomet