Sciweavers

23557 search results - page 489 / 4712
» Distributed Computing - Introduction
Sort
View
CORR
2008
Springer
134views Education» more  CORR 2008»
15 years 5 months ago
Algorithmic Based Fault Tolerance Applied to High Performance Computing
: We present a new approach to fault tolerance for High Performance Computing system. Our approach is based on a careful adaptation of the Algorithmic Based Fault Tolerance techniq...
George Bosilca, Remi Delmas, Jack Dongarra, Julien...
140
Voted
CLUSTER
2002
IEEE
15 years 4 months ago
Condor-G: A Computation Management Agent for Multi-Institutional Grids
In recent years, there has been a dramatic increase in the amount of available computing and storage resources. Yet few have been able to exploit these resources in an aggregated ...
James Frey, Todd Tannenbaum, Miron Livny, Ian T. F...
IPPS
2010
IEEE
15 years 2 months ago
An auto-tuning framework for parallel multicore stencil computations
Although stencil auto-tuning has shown tremendous potential in effectively utilizing architectural resources, it has hitherto been limited to single kernel instantiations; in addi...
Shoaib Kamil, Cy Chan, Leonid Oliker, John Shalf, ...

Publication
1286views
17 years 3 months ago
A Quantitative Measure Of Fairness And Discrimination For Resource Allocation In Shared Computer Systems
Fairness is an important performance criterion in all resource allocation schemes, including those in distributed computer systems. However, it is often specified only qualitativel...
R. Jain, D. Chiu, and W. Hawe
IPPS
2007
IEEE
15 years 11 months ago
Automatic Performance Diagnosis of Parallel Computations with Compositional Models
Performance tuning involves a diagnostic process to locate and explain sources of program inefficiency. A performance diagnosis system can leverage knowledge of performance cause...
Li Li, Allen D. Malony