Sciweavers

1166 search results - page 19 / 234
» Crash Management for Distributed Parallel Systems
Sort
View
ICDCS
2012
IEEE
13 years 2 months ago
Tiresias: Online Anomaly Detection for Hierarchical Operational Network Data
Operational network data, management data such as customer care call logs and equipment system logs, is a very important source of information for network operators to detect prob...
Chi-Yao Hong, Matthew Caesar, Nick G. Duffield, Ji...
PODC
2004
ACM
15 years 5 months ago
Asynchronous group key exchange with failures
Group key exchange protocols allow a group of servers communicating over an asynchronous network of point-to-point links to establish a common key, such that an adversary which fu...
Christian Cachin, Reto Strobl
DNA
2009
Springer
145views Bioinformatics» more  DNA 2009»
15 years 6 months ago
Distributed Agreement in Tile Self-assembly
Abstract. Laboratory investigations have shown that a formal theory of fault-tolerance will be essential to harness nanoscale self-assembly as a medium of computation. Several rese...
Aaron Sterling
HPDC
2000
IEEE
15 years 4 months ago
A Monitoring Sensor Management System for Grid Environments
Large distributed systems such as Computational Grids require a large amount of monitoring data be collected for a variety of tasks such as fault detection, performance analysis, ...
Brian Tierney, Brian Crowley, Dan Gunter, Mason Ho...
SIGMOD
2007
ACM
158views Database» more  SIGMOD 2007»
15 years 12 months ago
Log-based recovery for middleware servers
We have developed new methods for log-based recovery for middleware servers which involve thread pooling, private inmemory states for clients, shared in-memory state and message i...
Rui Wang 0002, Betty Salzberg, David B. Lomet