Sciweavers

1166 search results - page 53 / 234
» Crash Management for Distributed Parallel Systems
Sort
View
IPPS
2006
IEEE
15 years 5 months ago
Easy and reliable cluster management: the self-management experience of Fire Phoenix
High-Performance clusters are rapidly becoming an important computing platform for both scientific and business applications. To fulfill the new demands and challenges, cluster sy...
Zhihong Zhang, Dan Meng, Jianfeng Zhan, Lei Wang, ...
IPPS
1998
IEEE
15 years 4 months ago
Efficient Runtime Thread Management for the Nano-Threads Programming Model
Abstract. The nano-threads programming model was proposed to effectively integrate multiprogramming on shared-memory multiprocessors, with the exploitation of fine-grain parallelis...
Dimitrios S. Nikolopoulos, Eleftherios D. Polychro...
PPAM
2007
Springer
15 years 6 months ago
Using HLA and Grid for Distributed Multiscale Simulations
Combining simulations of different scale in one application is non-trivial issue. This paper proposes solution that supports complex time interactions that can appear between elem...
Katarzyna Rycerz, Marian Bubak, Peter M. A. Sloot
ICA3PP
2005
Springer
15 years 5 months ago
Mining Traces of Large Scale Systems
Abstract— Large scale distributed computing infrastructure captures the use of high number of nodes, poor communication performance and continously varying resources that are not...
Christophe Cérin, Michel Koskas
IPPS
2007
IEEE
15 years 6 months ago
A Flexible Resource Management Architecture for the Blue Gene/P Supercomputer
Blue Gene R /P is a massively parallel supercomputer intended as the successor to Blue Gene/L. It leverages much of the existing architecture of its predecessor to provide scalabi...
Sam Miller, Mark Megerian, Paul Allen, Tom Budnik