Search Sciweavers | Sciweavers

15

CCGRID
2006
IEEE

131views Distributed And Parallel Com...» more CCGRID 2006»

Proposal of MPI Operation Level Checkpoint/Rollback and One Implementation

13 years 11 months ago

With the increasing number of processors in modern HPC(High Performance Computing) systems, there are two emergent problems to solve. One is scalability, the other is fault tolera...

Yuan Tang, Graham E. Fagg, Jack Dongarra

claim paper

Read More »

23

click to vote

ICDCS
2012
IEEE

238views Distributed And Parallel Com...» more ICDCS 2012»

Combining Partial Redundancy and Checkpointing for HPC

11 years 8 months ago

Download moss.csc.ncsu.edu

Today’s largest High Performance Computing (HPC) systems exceed one Petaﬂops (1015 ﬂoating point operations per second) and exascale systems are projected within seven years...

James Elliott, Kishor Kharbas, David Fiala, Frank ...

claim paper

Read More »

14

click to vote

SBACPAD
2008
IEEE

127views Hardware» more SBACPAD 2008»

Measuring Operating System Overhead on CMT Processors

13 years 12 months ago

Download people.ac.upc.edu

Numerous studies have shown that Operating System (OS) noise is one of the reasons for signiﬁcant performance degradation in clustered architectures. Although many studies exami...

Petar Radojkovic, Vladimir Cakarevic, Javier Verd&...

claim paper

Read More »

9

click to vote

CLUSTER
2007
IEEE

123views Distributed And Parallel Com...» more CLUSTER 2007»

A feasibility analysis of power-awareness and energy minimization in modern interconnects for high-performance computing

13 years 9 months ago

Download post.queensu.ca

High-performance computing (HPC) systems consume a significant amount of power, resulting in high operational costs, reduced reliability, and wasting of natural resources. Therefor...

Reza Zamani, Ahmad Afsahi, Ying Qian, V. Carl Hama...

claim paper

Read More »

14

click to vote

SPE
2010

114views more SPE 2010»

A survey of the research on power management techniques for high-performance systems

13 years 3 months ago

Download cms.brookes.ac.uk

This paper surveys the research on power management techniques for high performance systems. These include both commercial high performance clusters and scientific high performanc...

Yongpeng Liu, Hong Zhu

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers