Sciweavers

JSSPP
2004
Springer
13 years 10 months ago
Performance Implications of Failures in Large-Scale Cluster Scheduling
As we continue to evolve into large-scale parallel systems, many of them employing hundreds of computing engines to take on mission-critical roles, it is crucial to design those s...
Yanyong Zhang, Mark S. Squillante, Anand Sivasubra...
JSSPP
2004
Springer
13 years 10 months ago
Workload Characteristics of a Multi-cluster Supercomputer
This paper presents a comprehensive characterization of a multi-cluster supercomputer1 workload using twelvemonth scientific research traces. Metrics that we characterize include...
Hui Li, David L. Groep, Lex Wolters
JSSPP
2004
Springer
13 years 10 months ago
Multi-toroidal Interconnects: Using Additional Communication Links to Improve Utilization of Parallel Computers
Three-dimensional torus is a common topology of network interconnects of multicomputers due to its simplicity and high scalability. A parallel job submitted to a three-dimensional...
Yariv Aridor, Tamar Domany, Oleg Goldshmidt, Edi S...
JSSPP
2004
Springer
13 years 10 months ago
Reconfigurable Gang Scheduling Algorithm
 Using a single traditional gang scheduling algorithm cannot provide the best performance for all workloads and parallel architectures. A solution for this problem is the use of...
Luís Fabrício Wanderley Góes,...
JSSPP
2004
Springer
13 years 10 months ago
Enhancements to the Decision Process of the Self-Tuning dynP Scheduler
The self-tuning dynP scheduler for modern cluster resource management systems switches between different basic scheduling policies dynamically during run time. This allows to reac...
Achim Streit
Distributed And Parallel Computing
Top of PageReset Settings