Sciweavers

2226 search results - page 365 / 446
» Fault-Tolerant Parallel Applications with Dynamic Parallel S...
Sort
View
CLUSTER
2002
IEEE
14 years 9 months ago
PHOENIX: A Self Adaptable Monitoring Platform for Cluster Management
Distributed systems based on cluster of workstation are more and more difficult to manage due to the increasing number of processors involved, and the complexity of associated appl...
Céline Boutrous-Saab, Xavier Bonnaire, Bert...
SC
2004
ACM
15 years 3 months ago
Performance Tool Support for MPI-2 on Linux
Programmers of message-passing codes for clusters of workstations face a daunting challenge in understanding the performance bottlenecks of their applications. This is largely due...
Kathryn Mohror, Karen L. Karavanic
80
Voted
ASPLOS
2006
ACM
15 years 1 months ago
Accurate and efficient filtering for the Intel thread checker race detector
Debugging data races in parallel applications is a difficult task. Error-causing data races may appear to vanish due to changes in an application's optimization level, thread...
Paul Sack, Brian E. Bliss, Zhiqiang Ma, Paul Peter...
ICS
2005
Tsinghua U.
15 years 3 months ago
Continuous Replica Placement schemes in distributed systems
The Replica Placement Problem (RPP) aims at creating a set of duplicated data objects across the nodes of a distributed system in order to optimize certain criteria. Typically, RP...
Thanasis Loukopoulos, Petros Lampsas, Ishfaq Ahmad
ICDCS
1991
IEEE
15 years 1 months ago
Supporting the development of network programs
of ‘‘network computers’’ is inherently lessAbstract predictable than that of more traditional distributed memory systems, such as hypercubes [22], since both theFor computa...
Bernd Bruegge, Peter Steenkiste