Sciweavers

1166 search results - page 2 / 234
» Crash Management for Distributed Parallel Systems
Sort
View
VEE
2010
ACM
238views Virtualization» more  VEE 2010»
13 years 10 months ago
Optimizing crash dump in virtualized environments
Crash dump, or core dump is the typical way to save memory image on system crash for future offline debugging and analysis. However, for typical server machines with likely abund...
Yijian Huang, Haibo Chen, Binyu Zang
SRDS
1991
IEEE
13 years 8 months ago
A Fault-Tolerant, Scalable, Low-Overhead Distributed Garbage Detection Protocol
We present a protocol for the distributed detection of garbage in a distributed system subject to common failures such as lost and duplicated messages, network partition, dismount...
Marc Shapiro
EUROPAR
2007
Springer
13 years 11 months ago
On Detecting Termination in the Crash-Recovery Model
We investigate the problem of detecting termination of a distributed computation in an asynchronous message-passing system where processes may crash and recover. We show that it is...
Felix C. Freiling, Matthias Majuntke, Neeraj Mitta...
IPPS
2009
IEEE
13 years 12 months ago
Crash fault detection in celerating environments
Failure detectors are a service that provides (approximate) information about process crashes in a distributed system. The well-known “eventually perfect” failure detector, 3P...
Srikanth Sastry, Scott M. Pike, Jennifer L. Welch
COLCOM
2009
IEEE
13 years 9 months ago
An IT appliance for remote collaborative review of mechanisms of injury to children in motor vehicle crashes
This paper describes the architecture and implementation of a Java-based appliance for collaborative review of crashes involving injured children in order to determine mechanisms o...
Mahendra Kumar, Richard E. Newman, José For...