Search Sciweavers | Sciweavers

647 search results - page 1 / 130

» Simulating Failures on Large-Scale Systems

121

click to vote

ICPPW
2008
IEEE

93views Distributed And Parallel Com...» more ICPPW 2008»

Simulating Failures on Large-Scale Systems

15 years 10 months ago

Download www.mcs.anl.gov

—Developing fault management mechanisms is a difﬁcult task because of the unpredictable nature of failures. In this paper, we present a fault simulation framework for Blue Gene...

Narayan Desai, Ewing L. Lusk, Daniel Buettner, And...

claim paper

Read More »

164

click to vote

CCGRID
2006
IEEE

130views Distributed And Parallel Com...» more CCGRID 2006»

A Failure-Aware Scheduling Strategy in Large-Scale Cluster System

15 years 10 months ago

Download www.ncic.ac.cn

As the scale is expanding, node failure becomes a commonplace feature of large-scale cluster systems. As an important part of cluster operating system software, job scheduling tak...

Linping Wu, Dan Meng, Jianfeng Zhan, Wang Lei, Bib...

claim paper

Read More »

137

click to vote

IPPS
2005
IEEE

132views Distributed And Parallel Com...» more IPPS 2005»

Performance Implications of Periodic Checkpointing on Large-Scale Cluster Systems

15 years 9 months ago

Download adam.oliner.net

Large-scale systems like BlueGene/L are susceptible to a number of software and hardware failures that can affect system performance. Periodic application checkpointing is a commo...

Adam J. Oliner, Ramendra K. Sahoo, José E. ...

claim paper

Read More »

162

click to vote

DBKDA
2010
IEEE

127views Database» more DBKDA 2010»

Failure-Tolerant Transaction Routing at Large Scale

15 years 2 months ago

Download www-poleia.lip6.fr

—Emerging Web2.0 applications such as virtual worlds or social networking websites strongly differ from usual OLTP applications. First, the transactions are encapsulated in an AP...

Idrissa Sarr, Hubert Naacke, Stéphane Gan&c...

claim paper

Read More »

152

click to vote

MASCOTS
2001

167views Modeling And Simulation» more MASCOTS 2001»

Large-Scale Simulation of Replica Placement Algorithms for a Serverless Distributed File System

15 years 5 months ago

Download research.microsoft.com

Farsite is a scalable, distributed file system that logically functions as a centralized file server but that is physically implemented on a set of client desktop computers. Farsi...

John R. Douceur, Roger Wattenhofer

claim paper

Read More »

« Prev « First page 1 / 130 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers