Sciweavers

104 search results - page 20 / 21
» A Framework for Node-Level Fault Tolerance in Distributed Re...
Sort
View
IPPS
2006
IEEE
14 years 13 days ago
A self-stabilizing minimal dominating set algorithm with safe convergence
A self-stabilizing distributed system is a faulttolerant distributed system that tolerates any kind and any finite number of transient faults, such as message loss and memory cor...
Hirotsugu Kakugawa, Toshimitsu Masuzawa
HPDC
2008
IEEE
14 years 26 days ago
DataLab: transactional data-parallel computing on an active storage cloud
Active storage clouds are an attractive platform for executing large data intensive workloads found in many fields of science. However, active storage presents new system managem...
Brandon Rich, Douglas Thain
HPDC
2012
IEEE
11 years 8 months ago
Understanding the effects and implications of compute node related failures in hadoop
Hadoop has become a critical component in today’s cloud environment. Ensuring good performance for Hadoop is paramount for the wide-range of applications built on top of it. In ...
Florin Dinu, T. S. Eugene Ng
ICAS
2005
IEEE
155views Robotics» more  ICAS 2005»
14 years 13 hour ago
Analyzing the Impact of Components Replication in High Available J2EE Clusters
Clustering is a well known technique that allows scalability and fault tolerance in distributed systems. In the J2EE framework, clustering can be used to improve the performance a...
Davide Rossi, Elisa Turrini
SIGMOD
2008
ACM
139views Database» more  SIGMOD 2008»
14 years 6 months ago
Paths to stardom: calibrating the potential of a peer-based data management system
As peer-to-peer (P2P) networks become more familiar to the database community, intense interest has built up in using their scalability and resilience properties to scale database...
Mihai Lupu, Beng Chin Ooi, Y. C. Tay