Sciweavers

11 search results - page 1 / 3
» Selective Recovery from Failures in a Task Parallel Programm...
Sort
View
CCGRID
2010
IEEE
13 years 6 months ago
Selective Recovery from Failures in a Task Parallel Programming Model
Abstract--We present a fault tolerant task pool execution environment that is capable of performing fine-grain selective restart using a lightweight, distributed task completion tr...
James Dinan, Arjun Singri, P. Sadayappan, Sriram K...
SRDS
2008
IEEE
13 years 11 months ago
Application-Level Recovery Mechanisms for Context-Aware Pervasive Computing
We identify here various kinds of failure conditions and robustness issues that arise in context-aware pervasive computing applications. Such conditions are related to failures in...
Devdatta Kulkarni, Anand Tripathi
CCGRID
2009
IEEE
13 years 8 months ago
Failure-Aware Construction and Reconfiguration of Distributed Virtual Machines for High Availability Computing
In large-scale clusters and computational grids, component failures become norms instead of exceptions. Failure occurrence as well as its impact on system performance and operatio...
Song Fu
IWOMP
2009
Springer
13 years 11 months ago
A Proposal to Extend the OpenMP Tasking Model for Heterogeneous Architectures
Abstract. OpenMP has evolved recently towards expressing unstructured parallelism, targeting the parallelization of a broader range of applications in the current multicore era. Ho...
Eduard Ayguadé, Rosa M. Badia, Daniel Cabre...
HPDC
2010
IEEE
13 years 6 months ago
A data transfer framework for large-scale science experiments
Modern scientific experiments can generate hundreds of gigabytes to terabytes or even petabytes of data that may furthermore be maintained in large numbers of relatively small fil...
Wantao Liu, Brian Tieman, Rajkumar Kettimuthu, Ian...