Sciweavers

482 search results - page 83 / 97
» A large-scale study of failures in high-performance computin...
Sort
View
118
Voted
ICS
2007
Tsinghua U.
15 years 5 months ago
Proactive fault tolerance for HPC with Xen virtualization
Large-scale parallel computing is relying increasingly on clusters with thousands of processors. At such large counts of compute nodes, faults are becoming common place. Current t...
Arun Babu Nagarajan, Frank Mueller, Christian Enge...
DSN
2002
IEEE
15 years 4 months ago
An Automated Approach to Increasing the Robustness of C Libraries
As our reliance on computers increases, so does the need for robust software. Previous studies have shown that many C libraries exhibit robustness problems due to exceptional inpu...
Christof Fetzer, Zhen Xiao
102
Voted
ICDCS
2005
IEEE
15 years 5 months ago
FraNtiC: A Fractal Geometric Framework for Mesh-Based Wireless Access Networks
The design of the access networks of next generation broadband wireless systems requires special attention in the light of changing network characteristics. In this paper, we pres...
Samik Ghosh, Kalyan Basu, Sajal K. Das
94
Voted
MSS
2007
IEEE
105views Hardware» more  MSS 2007»
15 years 6 months ago
Quota enforcement for high-performance distributed storage systems
Storage systems manage quotas to ensure that no one user can use more than their share of storage, and that each user gets the storage they need. This is difficult for large, dis...
Kristal T. Pollack, Darrell D. E. Long, Richard A....
IPPS
2002
IEEE
15 years 4 months ago
Performance Prediction Technology for Agent-Based Resource Management in Grid Environments
Resource management constitutes an important infrastructural component of a computational grid environment. The aim of grid resource management is to efficiently schedule applicat...
Junwei Cao, Stephen A. Jarvis, Daniel P. Spooner, ...