Search Sciweavers | Sciweavers

453 search results - page 16 / 91

» Fault-Tolerant Techniques for Ambient Intelligent Distribute...

118

click to vote

HPDC
2000
IEEE

121views Distributed And Parallel Com...» more HPDC 2000»

Distributed Processor Allocation in Large PC Clusters

15 years 6 months ago

Download www.cs.hmc.edu

Current processor allocation techniques for highly parallel systems are based on centralized front-end based algorithms. As a result, the applied strategies are restricted to stat...

Hans-Ulrich Heiss, César A. F. De Rose, Phi...

claim paper

Read More »

110

click to vote

ISPA
2004
Springer

146views Distributed And Parallel Com...» more ISPA 2004»

Highly Reliable Linux HPC Clusters: Self-Awareness Approach

15 years 7 months ago

Download xcr.cenit.latech.edu

Abstract. Current solutions for fault-tolerance in HPC systems focus on dealing with the result of a failure. However, most are unable to handle runtime system configuration change...

Chokchai Leangsuksun, Tong Liu, Yudan Liu, Stephen...

claim paper

Read More »

124

click to vote

SIGSOFT
2008
ACM

163views Software Engineering» more SIGSOFT 2008»

Experimenting with exception propagation mechanisms in service-oriented architecture

16 years 2 months ago

Download www.cs.ncl.ac.uk

Exception handling is one of the popular means used for improving dependability and supporting recovery in the ServiceOriented Architecture (SOA). This practical experience paper ...

Anatoliy Gorbenko, Alexander Romanovsky, Vyachesla...

claim paper

Read More »

117

Voted

SPAA
2010
ACM

161views Distributed And Parallel Com...» more SPAA 2010»

Securing every bit: authenticated broadcast in radio networks

15 years 6 months ago

Download infoscience.epfl.ch

This paper studies non-cryptographic authenticated broadcast in radio networks subject to malicious failures. We introduce two protocols that address this problem. The ﬁrst, Nei...

Dan Alistarh, Seth Gilbert, Rachid Guerraoui, Zark...

claim paper

Read More »

167

click to vote

ICDCS
2012
IEEE

238views Distributed And Parallel Com...» more ICDCS 2012»

Combining Partial Redundancy and Checkpointing for HPC

13 years 4 months ago

Download moss.csc.ncsu.edu

Today’s largest High Performance Computing (HPC) systems exceed one Petaﬂops (1015 ﬂoating point operations per second) and exascale systems are projected within seven years...

James Elliott, Kishor Kharbas, David Fiala, Frank ...

claim paper

Read More »

« Prev « First page 16 / 91 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers