Sciweavers

978 search results - page 9 / 196
» A performance study of job management systems
Sort
View
JSSPP
2009
Springer
15 years 4 months ago
Analyzing the EGEE Production Grid Workload: Application to Jobs Submission Optimization
Abstract. Grids reliability remains an order of magnitude below clusters on production infrastructures. This work is aimsed at improving grid application performances by improving ...
Diane Lingrand, Johan Montagnat, Janusz Martyniak,...
DSOM
2004
Springer
15 years 2 months ago
Failure Recovery in Distributed Environments with Advance Reservation Management Systems
Resource reservations in advance are a mature concept for the allocation of various resources, particularly in grid environments. Common grid toolkits such as Globus support advanc...
Lars-Olof Burchard, Barry Linnert
IPPS
2005
IEEE
15 years 3 months ago
Performance Implications of Periodic Checkpointing on Large-Scale Cluster Systems
Large-scale systems like BlueGene/L are susceptible to a number of software and hardware failures that can affect system performance. Periodic application checkpointing is a commo...
Adam J. Oliner, Ramendra K. Sahoo, José E. ...
HPDC
2003
IEEE
15 years 2 months ago
Trace-Based Simulations of Processor Co-Allocation Policies in Multiclusters
In systems consisting of multiple clusters of processors which employ space sharing for scheduling jobs, such as our Distributed ASCI1 Supercomputer (DAS), coallocation, i.e., the...
Anca I. D. Bucur, Dick H. J. Epema
GRID
2007
Springer
14 years 9 months ago
A Tool for Prioritizing DAGMan Jobs and its Evaluation
It is often difficult to perform efficiently a collection of jobs with complex job dependencies due to temporal unpredictability of the grid. One way to mitigate the unpredictabili...
Grzegorz Malewicz, Ian T. Foster, Arnold L. Rosenb...