Sciweavers

146 search results - page 3 / 30
» Transparent Checkpoint-Restart of Distributed Applications o...
Sort
View
SIGMETRICS
2010
ACM
201views Hardware» more  SIGMETRICS 2010»
13 years 10 months ago
Transparent, lightweight application execution replay on commodity multiprocessor operating systems
We present S, the first system to provide transparent, lowoverhead application record-replay and the ability to go live from replayed execution. S i...
Oren Laadan, Nicolas Viennot, Jason Nieh
NSDI
2008
13 years 7 months ago
Maelstrom: Transparent Error Correction for Lambda Networks
The global network of datacenters is emerging as an important distributed systems paradigm -- commodity clusters running high-performance applications, connected by high-speed `la...
Mahesh Balakrishnan, Tudor Marian, Ken Birman, Hak...
IPPS
2007
IEEE
13 years 11 months ago
MultiEdge: An Edge-based Communication Subsystem for Scalable Commodity Servers
At the core of contemporary high performance computer systems is the communication infrastructure. For this reason, there has been a lot of work on providing low-latency, high-ban...
Sven Karlsson, Stavros Passas, George Kotsis, Ange...
VR
2002
IEEE
13 years 10 months ago
Net Juggler: Running VR Juggler with Multiple Displays on a Commodity Component Cluster
Net Juggler is an open source library that turns a commodity component cluster running the VR Juggler platform on each node into a single VR Juggler image cluster. Application par...
Jérémie Allard, Valérie Goura...
ICDCS
2012
IEEE
11 years 7 months ago
Combining Partial Redundancy and Checkpointing for HPC
Today’s largest High Performance Computing (HPC) systems exceed one Petaflops (1015 floating point operations per second) and exascale systems are projected within seven years...
James Elliott, Kishor Kharbas, David Fiala, Frank ...