Sciweavers

804 search results - page 2 / 161
» Measuring the Performance and Reliability of Production Comp...
Sort
View
CCGRID
2007
IEEE
13 years 11 months ago
Optimizing jobs timeouts on clusters and production grids
This paper presents a method to optimize the timeout value of computing jobs. It relies on a model of the job execution time that considers the job management system latency throu...
Tristan Glatard, Xavier Pennec
SP
2002
IEEE
134views Security Privacy» more  SP 2002»
13 years 5 months ago
Performance engineering, PSEs and the GRID
Performance Engineering is concerned with the reliable prediction and estimation of the performance of scientific and engineering applications on a variety of parallel and distrib...
Tony Hey, Juri Papay
SC
2009
ACM
14 years 3 days ago
FALCON: a system for reliable checkpoint recovery in shared grid environments
In Fine-Grained Cycle Sharing (FGCS) systems, machine owners voluntarily share their unused CPU cycles with guest jobs, as long as the performance degradation is tolerable. For gu...
Tanzima Zerin Islam, Saurabh Bagchi, Rudolf Eigenm...
ICIW
2009
IEEE
13 years 12 months ago
An Architecture for Reliable Mobile Workflow in a Grid Environment
— Mobile peer to peer (P2P) computing is becoming a major revolution in computing owing to advances in computing power, network connectivity and storage capacity of mobile device...
Bill Karakostas, George Fakas
METRICS
2005
IEEE
13 years 11 months ago
Measuring Productivity on High Performance Computers
In the high performance computing domain, the speed of execution of a program has typically been the primary performance metric. But productivity is also of concern to high perfor...
Marvin V. Zelkowitz, Victor R. Basili, Sima Asgari...