Sciweavers
Explore
Publications
Books
Software
Tutorials
Presentations
Lectures Notes
Datasets
Labs
Conferences
Community
Upcoming
Conferences
Top Ranked Papers
Most Viewed Conferences
Conferences by Acronym
Conferences by Subject
Conferences by Year
Tools
Sci2ools
International Keyboard
Graphical Social Symbols
CSS3 Style Generator
OCR
Web Page to Image
Web Page to PDF
Merge PDF
Split PDF
Latex Equation Editor
Extract Images from PDF
Convert JPEG to PS
Convert Latex to Word
Convert Word to PDF
Image Converter
PDF Converter
Community
Sciweavers
About
Terms of Use
Privacy Policy
Cookies
2
search results - page 1 / 1
»
DCR: A fully transparent checkpoint restart framework for di...
Sort
relevance
views
votes
recent
update
View
thumb
title
5
click to vote
CLUSTER
2009
IEEE
154
views
Distributed And Parallel Com...
»
more
CLUSTER 2009
»
DCR: A fully transparent checkpoint/restart framework for distributed systems
13 years 11 months ago
Download
www.cluster2009.org
Can Ma, Zhigang Huo, Jingnan Cai, Dan Meng
claim paper
Read More »
15
click to vote
IPPS
2007
IEEE
137
views
Distributed And Parallel Com...
»
more
IPPS 2007
»
The Design and Implementation of Checkpoint/Restart Process Fault Tolerance for Open MPI
13 years 11 months ago
Download
www.open-mpi.org
To be able to fully exploit ever larger computing platforms, modern HPC applications and system software must be able to tolerate inevitable faults. Historically, MPI implementati...
Joshua Hursey, Jeffrey M. Squyres, Timothy Mattox,...
claim paper
Read More »
« Prev
« First
page 1 / 1
Last »
Next »