Sciweavers

260 search results - page 13 / 52
» Reliable fault-tolerant sensors for distributed systems
Sort
View
CCGRID
2006
IEEE
15 years 5 months ago
MPI-Mitten: Enabling Migration Technology in MPI
Group communications are commonly used in parallel and distributed environment. However, existing migration mechanisms do not support group communications. This weakness prevents ...
Cong Du, Xian-He Sun
ISPA
2004
Springer
15 years 5 months ago
Highly Reliable Linux HPC Clusters: Self-Awareness Approach
Abstract. Current solutions for fault-tolerance in HPC systems focus on dealing with the result of a failure. However, most are unable to handle runtime system configuration change...
Chokchai Leangsuksun, Tong Liu, Yudan Liu, Stephen...
CASES
2009
ACM
15 years 6 months ago
Towards scalable reliability frameworks for error prone CMPs
As technology scales and the energy of computation continually approaches thermal equilibrium [1,2], parameter variations and noise levels will lead to larger error rates at vario...
Joseph Sloan, Rakesh Kumar
ISCIS
2004
Springer
15 years 5 months ago
Mutation-Like Oriented Diversity for Dependability Improvement: A Distributed System Case Study
Abstract. Achieving higher levels of dependability is a goal in any software project, therefore strategies for software reliability improvement are very attractive. This work intro...
Daniel O. Bortolas, Avelino F. Zorzo, Eduardo A. B...
SAC
2005
ACM
15 years 5 months ago
Efficient placement and routing in grid-based networks
This paper presents an efficient technique for placement and routing of sensors/actuators and processing units in a grid network. Our system requires an extremely high level of ro...
Roozbeh Jafari, Foad Dabiri, Bo-Kyung Choi, Majid ...