Sciweavers

554
Voted
SC
2015
ACM
9 years 8 months ago
BAD-check: bulk asynchronous distributed checkpointing
Leadership-scale scientific simulations running as tens of thousands of tightly-coupled MPI processes are vulnerable to interruption due to a single process or node failure. Due ...
John Bent, Brad Settlemyer, Haiyun Bao, Sorin Faib...
SC
2015
ACM
9 years 8 months ago
A practical approach to reconciling availability, performance, and capacity in provisioning extreme-scale storage systems
The increasing data demands from high-performance computing applications significantly accelerate the capacity, capability and reliability requirements of storage systems. As sys...
SC
2015
ACM
9 years 8 months ago
HipMer: an extreme-scale de novo genome assembler
De novo whole genome assembly reconstructs genomic sequences from short, overlapping, and potentially erroneous DNA segments and is one of the most important computations in moder...
SC
2015
ACM
9 years 8 months ago
Multi-objective job placement in clusters
One of the key decisions made by both MapReduce and HPC cluster management frameworks is the placement of jobs within a cluster. To make this decision, they consider factors like ...
Applied Computing
Top of PageReset Settings