Search Sciweavers | Sciweavers

482 search results - page 1 / 97

» A large-scale study of failures in high-performance computin...

click to vote

ICPP
2008
IEEE

152views Distributed And Parallel Com...» more ICPP 2008»

Dynamic Meta-Learning for Failure Prediction in Large-Scale Systems: A Case Study

13 years 11 months ago

Download www.cs.iit.edu

Despite great efforts on the design of ultra-reliable components, the increase of system size and complexity has outpaced the improvement of component reliability. As a result, fa...

Jiexing Gu, Ziming Zheng, Zhiling Lan, John White,...

claim paper

Read More »

click to vote

WEA
2005
Springer

176views Algorithms» more WEA 2005»

High-Performance Algorithm Engineering for Large-Scale Graph Problems and Computational Biology

13 years 10 months ago

Download www.cc.gatech.edu

Abstract. Many large-scale optimization problems rely on graph theoretic solutions; yet high-performance computing has traditionally focused on regular applications with high degre...

David A. Bader

claim paper

Read More »

click to vote

HIPC
2000
Springer

149views Distributed And Parallel Com...» more HIPC 2000»

Meta-data Management System for High-Performance Large-Scale Scientific Data Access

13 years 8 months ago

Download www.eecs.northwestern.edu

Many scientific applications manipulate large amount of data and, therefore, are parallelized on high-performance computing systems to take advantage of their computational power a...

Wei-keng Liao, Xiaohui Shen, Alok N. Choudhary

claim paper

Read More »

click to vote

DSN
2006
IEEE

231views Computer Networks» more DSN 2006»

Improving BGP Convergence Delay for Large-Scale Failures

13 years 11 months ago

Download www.cs.ucdavis.edu

Border Gateway Protocol (BGP) is the standard routing protocol used in the Internet for routing packets between the Autonomous Systems (ASes). It is known that BGP can take hundre...

Amit Sahoo, Krishna Kant, Prasant Mohapatra

claim paper

Read More »

click to vote

IPPS
2005
IEEE

132views Distributed And Parallel Com...» more IPPS 2005»

Performance Implications of Periodic Checkpointing on Large-Scale Cluster Systems

13 years 10 months ago

Download adam.oliner.net

Large-scale systems like BlueGene/L are susceptible to a number of software and hardware failures that can affect system performance. Periodic application checkpointing is a commo...

Adam J. Oliner, Ramendra K. Sahoo, José E. ...

claim paper

Read More »

« Prev « First page 1 / 97 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers